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TITLE OF THE INVENTION 

A METHOD FOR OPTIMISING GENE EXPRESSION USING 
SYNONYMOUS CODON OPTIMISATION 

FIELD OF THE INVENTION 
[0001] The present invention relates generally to gene expression. More particularly, the 
5 present invention relates to a method for modulating the quality of a selected phenotype that is 
displayed by an organism or part thereof and that results from the expression of a polynucleotide 
that encodes the polypeptide by replacing at least one codon of that polynucleotide with a 
synonymous codon that has a higher or lower preference of usage by the organism or part thereof 
to produce the selected phenotype than the codon it replaces. Even more particularly, the invention 
10 relates to the use of a protein-encoding polynucleotide whose codon composition has been 
modified for modulating the quality of a selected phenotype displayed by an organism or part 
thereof. 

BACKGROUND OF THE INVENTION 

[0002] The expression of foreign heterologous genes in transformed cells is now 
15 commonplace. A large number of mammalian genes, including, for example, murine and human 
genes, have been successfully expressed in various host cells, including bacterial, yeast, insect, 
plant and mammalian host cells. Nevertheless, despite the burgeoning knowledge of expression 
systems and recombinant DNA technology, significant obstacles remain when one attempts to 
express a foreign or synthetic gene in a selected host cell. For example, translation of a synthetic 
20 gene, even when coupled with a strong promoter, often proceeds much more slowly than would be 
expected. The same is frequently true of exogenous genes that are foreign to the host cell. This 
lower than expected translation efficiency is often due to the protein coding regions of the gene 
having a codon usage pattern that does not resemble those of highly expressed genes in the host 
cell. It is known in this regard that codon utilisation is highly biased and varies considerably in 
25 different organisms and that biases in codon usage can alter peptide elongation rates. It is also 
known that codon usage patterns are related to the relative abundance of tRNA isoacceptors, and 
that genes encoding proteins of high versus low abundance show differences in their codon 
preferences. 

[0003] The implications of codon preference phenomena on gene expression are manifest in 
30 that these phenomena can affect the translational efficiency of messenger RNA (mRNA). It is 
widely known in this regard that translation of "rare codons", for which the corresponding iso- 
tRNA is in low abundance relative to other iso-tRNAs, may cause a ribosome to pause during 
translation which can lead to a failure to complete a nascent polypeptide chain and an uncoupling 
of transcription and translation/Thus, the expression of an exogenous gene may be impeded 
35 severely if a particular host cell of an organism or the organism itself has a low abundance of iso- 
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tRNAs coiTcsponding to one or more codons of the exogenous gene. Accordingly, a major aim of 
investigators in this field is to first ascertain the codon preference for particular cells in which an 
exogenous gene is to be expressed, and to subsequently alter the codon composition of that gene 
for optimised expression in those cells* 



translationally inefficient protein coding regions. Traditionally, these techniques have been based 
on the replacement of codons that are rarely or infrequently used in the host cell with those that are 
liQSt-preferred. Codon frequencies can be derived from literature sources for the highly expressed 
genes of many organisms (see, for example, Nakamura el al, 1996, Nucleic Acids Res 24: 214- 
215). These frequencies are generally expressed on an 'organism- wide average basis' as the 
percentage of occasions that a synonymous codon is used to encode a corresponding amino acid 
across a collection of protein-encoding genes of that organism, which are preferably highly 
expressed. 

[0005J Typically, codons are classified as: (a) "common" codons (or "preferred" codons) if 
their frequency of usage is above about 4/3 x the frequency of usage that Would be expected in the 
absence of any bias in codon usage; (b) "rare" codons (or "non-preferred" codons) if their 
frequency of usage is below about 2/3 x the frequency of usage that would be expected in the 
absence of any bias in codon usage; and (c) "intermediate" codons (or "less preferred" codons) if 
their frequency of usage is in-between the frequency of usage of "common" codons and of "rare" 
codons. Since an amino acid can be encoded by 2, 3, 4 or 6 codons, the frequency of usage of any 
selected codon, which would be expected in the absence of any bias in codon usage, will be 
dependent upon the number of synonymous codons which code for the same amino acid as the 
selected codon. Accordingly, for a particular amino acid, the frequency thresholds for classifying 
codons in the "common", "intermediate" and "rare" categories will be dependent upon the number 
of synonymous codons for that amino acid. Consequently, for amino acids having 6 choices of 
synonymous codon, the frequency of codon usage that would be expected in the absence of any 
bias in codon usage is 16% and thus the "common", "intermediate" and "rare" codons are defined 
as those codons that have a frequency of usage above 20%, between 10 and 20% and below 10%, 
respectively- For amino acids having 4 choices of synonymous codon, the frequency of codon 
usage that would be expected in the absence of codon usage bias is 25% and thus the "common", 
"intermediate" and "rare" codons are defined as those codons that have a frequency of usage above 
33%, between 16 and 33% and below 16%, respectively. For isoleucine, which is the only amino 
acid having 3 choices of synonymous codon, the frequency of codon usage that would be expected 
in the absence of any bias in codon usage is 33% and thus the "common", "intermediate" and 
"raTe" codons for isoleucine are defined as those codons that have a frequency of usage above 45%, 
between 20 and 45% and below 20%, respectively. For amino acids having 2 choices of 
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Codon-optimisation techniques are known for improving the translations! kinetics of 
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synonymous codon, the frequency of codon usage that would be expected in the absence of codon 
usage bias is 50% and thus the "common", "intermediate" and "rare" codons are defined as those 
codons that have a frequency of usage above 60%, between 30 and 60% and below 30%, 
respectively. Thus, the categorisation of codons into the "common", "intermediate" and "rare" 
classes (or "preferred", "less preferred" or "non preferred", respectively) has been based 
conventionally on a compilation of codon usage for an organism in general (e.g., 'human-wide*) or 
for a class of organisms in general (e.g., 'mammal-wide*). For example, reference may be made to 
Seed (see U.S. Patent Serial Nos 5,786,464 and 5,795,737) who discloses preferred, less preferred 
and non-preferred codons for mammalian cells in general. However, the present inventor revealed 
in WO 99/02694 and in WO 00/42190 that there are substantial differences in the relative 
abundance of particular iso-tRNAs in different cells or tissues of a single multicellular organism 
(e.g., a mammal or a plant) and that this plays a pivotal role in protein translation from a coding 
sequence with a given codon usage or composition. 

(0006] Thus, in contrast to the art-recognised presumption that different cells of a 
multicellular organism have the same bias in codon usage, it was revealed for the first time that one 
cell type of a multicellular organism uses codons in a manner distinct from another cell type of the 
same organism. In other words, it was revealed that different cells of an organism can exhibit 
different translational efficiencies for the same codon and that it was not possible to predict which 
codons would be preferred, less preferred or non preferred in a selected cell type. Accordingly, it 
was proposed that differences in codon translational efficiency between cell types could be 
exploited, together with codon composition of a gene, to regulate the production of a protein in, or 
to direct that production to, a chosen cell type. 

[0007] Therefore, in order to optimise the expression of a protein-encoding polynucleotide in 
a particular cell type, it is necessary to first determine the translational efficiency for each codon in 
that cell type, rather than to rely on codon frequencies calculated on an organism-wide average 
basis, and then to codon modify the polynucleotide based on that determination. 

(0008] All of the above methods relate to the codon optimisation of protein-encoding 
polynucleotides for modulating the expression of those polynucleotides in a chosen cell type or for 
differentially expressing those polynucleotides between selected cell types. 
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BRIEF SUMMARY OF THE INVENTION 



|O0O9] The present invention relates to a novel strategy for changing the quality of a selected 
phenotype that is displayed by an organism of interest and that results from the expression of a 
polynucleotide that encodes a polypeptide. In contrast to methods heretofore described, this 

5 strategy does not rely on codon usage data or on codon transitional efficiency data that may be 
applicable to the organism of interest. Instead, it relies on ranking individual synonymous codons 
that code for an amino acid in the polypeptide according to their preference of usage by the 
organism of interest for producing the selected phenotype. In other words, the subject method is 
based on determining the "phenotypic preferences" of individual synonymous codons. 

10 Advantageously, phenotypic preferences can be determined by introducing into the organism of 
interest or into a related organism a synthetic construct that comprises a regulatory polynucleotide 
operably linked to a tandem repeat of a codon fused in frame with a reporter polynucleotide that 
encodes a protein, which produces, or which is predicted to produce, the selected phenotype or a 
phenotype of the same class as the selected phenotype. The quality of the phenotype produced by 

15 the expression of the synthetic construct in the organism of interest or in the Telated organism is 
then determined using a suitable assay. The selected phenotype may be a therapeutic or 
prophylactic phenotype including immunity, tolerance, pathogen resistance etc, or other beneficial 
trait including enhancement or prevention of a repair process, pest resistance, frost resistance, 
herbicide tolerance etc. Li accordance with the present invention, a set of synonymous codons will 

20 often display a range of phenotypic preferences, which can be used as a basis for rationally 
selecting a codon in the original polynucleotide for replacement with a synonymous codon that has 
a different phenotypic preference. 

[0010) Thus, in one aspect of the present invention, methods are provided for constructing a 
synthetic polynucleotide from which a polypeptide is producible to confer a selected phenotype 

25 upon an organism of interest or part thereof in a different quality than that conferred by a parent 
polynucleotide that encodes the same polypeptide. These methods generally comprise: (a) 
selecting a first codon of the parent polynucleotide for replacement with a synonymous codon, 
wherein the synonymous codon is selected on the basis that it exhibits a different phenotypic 
preference than the first codon in a comparison of phenotypic preferences in test organisms or parts 

30 thereof, wherein the test organisms are selected from the group consisting of organisms of the same 
species as the organism of interest and organisms that are related to the organism of interest; and 
(b) replacing the first codon with the synonymous codon to construct the synthetic polynucleotide 

(0011) The phenotypic preferences of codons in the test organisms or parts are suitably 
compared by: (i) separately introducing into the test organisms or parts individual synthetic 

35 constructs, each of which comprises a regulatory polynucleotide operably linked to a tandem repeat 
of a codon fused in frame with a reporter polynucleotide that encodes a reporter protein, which 
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produces, or which is predicted to produce, a corresponding phenotype selected from the group 
consisting of the selected phenotype and a phenotype of the same class as the selected phenotype; 
and (ii) comparing the quality of the phenotypes displayed by the test organisms or parts to 
determine the relative phenotypic preferences of the codons. 

5 (0012] The synthetic constructs are typically introduced into the test organisms or parts using 
the same or similar mode of introduction. This is desirable when the corresponding phenotype or its 
quality is dependent on a particular mode or site of introduction of the synthetic constructs. 

[0013J In some embodiments, the tandem repeat of each of the synthetic constructs comprises 
at least three copies of the corresponding codon. 

10 [0014] In some embodiments, the synonymous codon is selected such that it has a higher 
phenotypic preference than the first codon. In accordance with the present invention, a higher 
phenotypic preference will correlate with a higher quality of the selected phenotype. Accordingly, a 
synonymous codon for these embodiments is preferably selectable when the quality of the 
phenotype conferred by the synthetic construct comprising a tandem repeat of the synonymous 

15 codon is suitably at least about 5% higher than the quality of the phenotype conferred by the 
synthetic construct comprising a tandem repeat of the first codon. 

[0015] In other embodiments, the synonymous codon is selected such that it has a lower 
phenotypic preference than the first codon. According to the subject invention, a lower phenotypic 
preference will correlate with a lower quality of the selected phenotype. Thus, a synonymous codon 
20 for these embodiments is preferably selectable when the quality of the phenotype conferred by the 
synthetic construct comprising a tandem repeat of the synonymous codon is suitably at least about 
5% lower than the quality of the phenotype conferred by the synthetic construct comprising a 
tandem repeat of the first codon. 

[0016] Although the present invention extends to unicellular organisms, it is preferably 
25 applicable to multicellular organisms including plants and animals. 

[0017] In another aspect, the invention provides methods for determining the phenotypic 
preference of a first codon in an organism of interest or part thereof. These methods generally 
comprise: (a) introducing a synthetic construct into a test organism or part thereof, wherein the test 
1 organism is selected from the group consisting of an organism of the same species as the organism 

30 of interest and an organism that is related to the organism of interest, the synthetic construct 
comprising a regulatory polynucleotide operably linked to a tandem repeat of the first codon fused 
in frame with a reporter polynucleotide that encodes a reporter protein, which produces, or which is 
predicted to produce, a selected phenotype or a phenotype of the same class as the selected 
phenotype; and (b) determining the quality of the corresponding phenotype displayed by the 
35 organism or part. 
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corresponding phenotype displayed by a test organism or part to which a synthetic construct 



corresponding phenotype displayed by a test organism or part to which a synthetic construct 
5 comprising a tandem repeat of a second codon was provided, wherein the second codon encodes 
the same amino acid as the first codon, to thereby determine the phenotypic preference of the first 
codon relative to the phenotypic preference of the second codon in the test organism or part. 



construct into a progenitor of a test organism or part; and (2) producing the test organism or part 
10 from the progenitor, wherein the test organism or part contains the synthetic construct. 

(0020] In other embodiments, the methods further comprise; (1) introducing the synthetic 
construct into a progenitor of a test organism or part; and (2) growing the test organism or part 
from the progenitor; wherein the test organism or part comprises a cell containing the synthetic 
construct. 

15 [0021] In still other embodiments, the methods further comprise: introducing the synthetic 
construct into a selected cell of the test organism or part. 

10022] In yet another aspect, the invention provides a synthetic polynucleotide constructed 
according to any one of the above methods. 

[0023] In even yet another aspect, the invention encompasses an organism of interest or part 
20 thereof containing a synthetic polynucleotide constructed according to any one of the above 
methods. 

[0024] In still another aspect, the invention contemplates an organism of interest or part 
thereof containing a synthetic construct that comprises a regulatory polynucleotide operably linked 
to a tandem repeat of a first codon fused in frame with a reporter polynucleotide that encodes a 
25 reporter protein, which produces, or which is predicted to produce, a selected phenotype or a 
phenotype of the same class as the selected phenotype. 

[0025] In a further aspect, the invention embraces methods of modulating the quality of a 
selected phenotype that is displayed by an organism of interest or part thereof and that results from 
the expression of a parent polynucleotide that encodes the polypeptide. These methods generally 
30 comprise: introducing into the organism or part a synthetic polynucleotide that is distinguished 
from the parent polynucleotide by the replacement of a first codon in the parent polynucleotide 
with a synonymous codon that exhibits a different phenotypic preference than the first codon in the 
organism or part. 

[0026] In yet a further aspect, the invention encompasses methods of enhancing the quality of 
35 a selected phenotype that is displayed by an organism of interest or part thereof and that results 



comprising a tandem repeat of the first codon was provided; and (ii) the quality of the 



[0019] 



In some embodiments, the methods further comprise: (1) introducing the synthetic 
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from the expression of a parent polynucleotide that encodes the polypeptide. These methods 
generally comprise: introducing into the organism or part a synthetic polynucleotide that is 
distinguished from the parent polynucleotide by the replacement of a first codon in the parent 
polynucleotide with a synonymous codon that exhibits a higher phenotypic preference than the first 
5 codon in the organism or part. 

[0027} In even yet a further aspect, the invention extends to methods of reducing the quality 
of a selected phenotype that is displayed by an organism of interest or part thereof and that results 
from the expression of a parent polynucleotide that encodes the polypeptide. These methods 
generally comprise: introducing into the organism or part a synthetic polynucleotide that is 
10 distinguished from the parent polynucleotide by the replacement of a first codon in the parent 
polynucleotide with a synonymous codon that exhibits a lower phenotypic preference than the first 
codon in the organism or part. 
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DETAILED DESCRIPTION OF THE INVENTION 

h Definitions 

[0028} Unless defined otherwise, all technical and scientific terms used herein have the same 
meaning as commonly understood by those of ordinary skill in the art to which the invention 

5 belongs. Although any methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the present invention, preferred methods and materials are 
described. For the purposes of the present invention, ihe following terms are defined below. 
[0029] The articles "a" and ' W are used herein to refer to one or to more than one (i.e., to at 
least one) of the grammatical object of the article. By way of example, "an element" means one 

10 element or more than one element. 

(0030] The term "about" is used herein to refer to parameters (e.g., amounts, concentrations, 
phenotype-conferring efficiencies, time etc) that vary by as much as 30%, 20%, 15%, 10% , 5% or 
even 4%, 3%, 2%, 1% to a specified parameter. 

[0031] As used herein, the term "cis~acting sequence" or "cis-regtriatoy region" or similar 
15 term shall be taken to mean any sequence of nucleotides which is derived from an expressible 
genetic sequence wherein the expression of the genetic sequence is regulated, at least in part, by 
said sequence of nucleotides. Those skilled in the art will be aware that a cis-regulatory region may 
be capable of activating, silencing, enhancing, repressing or otherwise altering the level of 
expression and/or cell-typc-specificity and/or developmental specificity of any structural gene 
20 sequence. 

[0032] Throughout this specification, unless the context requires otherwise, the v/ords 
"comprise", "comprises" and "comprising" will be understood to imply the inclusion of a stated 
step or element or group of steps or elements but not the exclusion of any other step or element or 
group of steps or elements. 
25 [0033] As used herein a "conferred phenotype" refers to a temporary or permanent change in 
the state of an organism of interest or class of organisms of interest, or of a part or tissue or cell or 
cell type or class of cell of an organism of interest, which occurs after the introduction of a 
polynucleotide to that organism, or to that class of organisms, or to the part or tissue or cell or cell 
type or class of cell, or to a precursor of that organism or part or tissue or cell or cell type or class 

30 of cell, and which would not have occurred in the absence of that introduction. Typically, such a 
temporary or permanent change occurs as a result of the transcription and/or translation of genetic 
information contained within that polynucleotide in the cell, or in at least one cell or cell type or 
class of cell within the organism of interest or within the class of class of organisms of interest, and 
can be used to distinguish the organism of interest, or class of organisms of interest, or part or 

35 tissue or cell or cell type or class of cell thereof, or genetic progeny of these, to which the 
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polynucleotide has been provided from a similar organism of interest, or class of organisms of 
interest, or part or tissue or cell or cell type or class of cell thereof, or genetic progeny of these, to 
which the polynucleotide has not been provided. 

(0034] By "expression vector" is meant any autonomous genetic element capable of directing 
the synthesis of a protein encoded by the vector. Such expression vectors are known by 
practitioners in the art. 

[0035] The term "gene" as used herein refers to any and all discrete coding regions of a host 
genome, or regions that code for a functional RNA only (e.g., tRNA, rRNA, regulatory RNAs such 
as ribozymes, post translational gene silencing-associated RNAs etc) as well as associated non- 
coding regions and optionally regulatory regions. In certain embodiments, the term "gene" gene 
includes within its scope the open reading frame encoding specific polypeptides, introns, and 
adjacent 5' and 3' non-coding nucleotide sequences involved in the regulation of expression. In this 
regard, the gene may further comprise control signals such as promoters, enhancers, termination 
and/or polyadenylation signals that are naturally associated with a given gene, or heterologous 
control signals. The gene sequences may be cDNA or genomic DNA or a fragment thereof. The 
gene may be introduced into an appropriate vector for extrachromosomal maintenance or for 
integration into the host. 

[0036] By "modulating", "modulate" and the like is meant increasing or decreasing, either 
directly or indirectly, the quality of a selected phenotype. 

[0037) By "natural gene" is meant a gene that naturally encodes the protein. However, it is 
possible that the parent polynucleotide encodes a protein that is not naturally-occurring but has 
been engineered using recombinant techniques. 

10038] The term "5 ' non-coding region" is used herein in its broadest context to include all 
nucleotide sequences which are derived from the upstream region of an expressible gene, other 
than those sequences which encode amino acid residues which comprise the polypeptide product of 
said gene, wherein 5' non-coding region confers or activates or otherwise facilitates, at least in part, 
expression of the gene. 

[0039J The term "oligonucleotide" as used herein refers to a polymer composed of a 
multiplicity of nucleotide units (deoxyribonucleotides or ribonucleotides, or related structural 
variants or synthetic analogues thereof) linked via phosphodiester bonds (or related structural 
variants or synthetic analogues (hereof). Thus, while the term "oligonucleotide" typically refers to a 
nucleotide polymer in which the nucleotides and linkages between them are naturally occumng, it 
will be understood that the term also includes withm its scope various analogues including, but not 
restricted to, peptide nucleic acids (PNAs), phosphoramidates, phosphorothioates, methyl 
phosphonates, 2-O-methyl ribonucleic acids, and the like. The exact size of the molecule may vary 
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depending on the particular application. An oligonucleotide is typically rather short in length, 
generally from about 10 to 30 nucleotides, but the term can refer to molecules of any length, 
although the term "polynucleotide" or "nucleic acid" is typically used for large oligonucleotides. 

(0040] The term "operably connected 11 or "operably linked* as used herein means placing a 
structural gene under the regulatory control of a promoter, which then controls the transcription and 
optionally translation of the gene. In the construction of heterologous promoter/structural gene 
combinations, it is generally preferred to position the genetic sequence or promoter at a distance 
from the gene transcription start site that is approximately the same as the distance between that 
genetic sequence or promoter and the gene it controls in its natural setting; i.e. the gene from which 
the genetic sequence or promoter is derived. As is known in the art, some variation in this distance 
can be accommodated without loss of function. Similarly, the preferred positioning of a regulatory 
sequence element with respect to a heterologous gene to be placed under its control is defined by 
the positioning of the element in its natural setting; i.e. the genes from which it is derived. 

[0041] The terms "precursor" and "progenitor" as used herein refer to a cell or part of 
organism that can gives rise to an organism of interest in which phenotypic expression is desired or 
in which phenotypic preference of a codon is to be determined. 

(0042) The term "phenotype" means any one or more detectable physical or functional 
characteristics, properties, attributes or traits of an organism, tissue, or cell, or class of organisms, 
tissues or cells, which generally result from the interaction between the genetic makeup (i.e., 
genotype) of the organism, tissue, or cell, or the class of organisms, tissues or cells and the 
environment. In certain embodiments, the term ''phenotype" excludes resistance to a selective agent 
or screening an enzymic or light-emitting activity, conferred directly by a reporter protein. 

[0043] By "phenotypic preference" is meant the preference with which an organism uses a 
codon to produce a selected phenotype. This preference can be evidenced, for example, by the 
quality of a selected phenotype that is producible by a polynucleotide that comprises the codon in 
an open reading frame which codes for a polypeptide that produces the selected phenotype. In 
certain embodiment, the preference of usage is independent of the route by which the 
polynucleotide is introduced into the organism. However, in other embodiments, the preference of 
usage is dependent on the route of introduction of the polynucleotide into the organism. 

[0044] The term "polynucleotide" or "nucleic acid" ' as used herein designates mRNA, RNA, 
cRNA, cDNA or DNA. The term typically refers to oligonucleotides greater than 30 nucleotides in 
length. 

[0045] "Polypeptide", "peptide" and "protein" are used interchangeably herein to refer to a 
polymer of amino acid residues and to variants and synthetic analogues of the same. Thus, these 
terms apply to amino acid polymers in which one or more amino acid residues is a synthetic non- 
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naturally occurring amino acid, such as a chemical analogue of a corresponding naturally occurring 
amino acid, as well as to naturally-occurring amino acid polymers. 

[00461 By "producing", and like terms such as "production 1 and "producible"* in the context 
or protein production, is meant production of a protein to a level sufficient to achieve a particular 
function or phenotype associated with the protein. By contrast, the terms "not producible'* and "not 
substantially producible'* as used interchangeably herein refer to (a) no production of a protein, (b) 
production of a protein to a level that is not sufficient to effect a particular function or phenotype 
associated with the protein, (c) production of a protein, which cannot be detected by a monoclonal 
antibody specific for the protein, or (d) production of a protein, which is less that 1% of the level 
produced in a wild-type cell that normally produces the protein. 

[0047] Reference herein to a "promoter'" is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical genomic gene, including the TATA box which is 
required for accurate transcription initiation, with or without a CCAAT box sequence and 
additional regulatory elements (i.e. upstream activating sequences, enhancers and silencers) which 
alter gene expression in response to developmental and/or environmental stimuli, or in a tissue- 
specific or cell-type-specific manner. A promoter is usually, but not necessarily, positioned 
upstream or 5\ of a structural gene, the expression of which it regulates. Furthermore, the 
regulatory elements comprising a promoter are usually positioned within 2 kb of the start site of 
transcription of the gene. Preferred promoters according to the invention may contain additional 
copies of one or more specific regulatory elements to further enhance expression in a cell, and/or to 
alter the timing of expression of a structural gene to which it is operably connected. 

(00481 The term "quality'* is used herein in its broadest sense and includes a measure, 
strength, intensity, degree or grade of a phenotype, e.g., a superior or inferior immune response, 
increased or decreased disease resistance, higher or lower sucrose accumulation, better or worse 
salt tolerance etc. 

10049] By "recombinant polypeptide" is meant a polypeptide made using recombinant 
techniques, i.e., through the expression of a recombinant or synthetic polynucleotide. 

[00501 The term "synthetic polynucleotide" as used herein refers to a polynucleotide formed 
in vitro by the manipulation of a polynucleotide into a form not normally found in nature. For 
example, the synthetic polynucleotide can be in the form of an expression vector. Generally, such 
expression vectors include transcriptional and translational regulatory polynucleotide operably 
linked to the polynucleotide. 

[0051] The term "synonymous codon 11 as used herein refers to a codon having a different 
nucleotide sequence than another codon but encoding the same amino acid as that other codon. 

[0052] By "vector" is meant a nucleic acid molecule, preferably a DNA molecule derived, for 
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example, from a plasmid, bacteriophage, or plant virus, into which a nucleic acid sequence may be 
inserted or cloned. A vector preferably contains one or more unique restriction sites and may be 
capable of autonomous replication in a defined host cell including a target cell or tissue or a 
progenitor cell or tissue thereof, or be integrable with the genome of the defined host such that the 
5 cloned sequence is reproducible. Accordingly, the vector may be an autonomously replicating 
vector, i.e., a vector that exists as an extrachromosomal entity, the replication of which is 
independent of chromosomal replication, e.g., a linear or closed circular plasmid, an 
extrachromosomal element, a minichromosome, or an artificial chromosome. The vector may 
contain any means for assuring self-replication. Alternatively, the vector may be one which, when 

10 introduced into the host cell, is integrated into the genome and replicated together with the 
' chromosome(s) into which it has been integrated. A vector system may comprise a single vector or 
plasmid, two or more vectors or plasmids, which together contain the total DNA to be introduced 
into the genome of the host cell, or a transposon. The choice of the vector will typically depend on 
the compatibility of the vector with the host Cell into which the vector is to be introduced. The 

15 vector may also include a selection marker such as an antibiotic resistance gene that can be used for 
selection of suitable transformants. Examples of such resistance genes are well known to those of 
skill in the art. 

2* Phenotypic preference of codons 

[0053] The present invention provides a novel strategy for enhancing or reducing the quality 
20 of a selected phenotype that is displayed, or proposed to be displayed, by an organism of interest] 
The strategy involves codon modification of a polynucleotide that encodes a phenotype-associated 
polypeptide that either by itself or in association with other molecules, in the organism of interest 
imparts or confers the selected phenotype upon the organism. Unlike previous methods, however, 
the strategy does not rely on data that provide a ranking of synonymous codons according to their 
25 preference of usage in an organism or class of organisms. Nor does it rely on data that provide a 
ranking of synonymous codons according to their translational efficiencies in one or more cells of 
the organism or class of organisms. Instead, it relies on ranking individual synonymous codons that 
code for an amino acid in the phenotype-associated polypeptide according to their preference of 
usage by the organism or class of organisms, or by a part thereof, for producing the selected 
30 phenotype. As used herein, the preference of usage of an individual codon by an organism or class 
of organisms or by a part thereof is referred to as a "phenotypic preference". Such preference may 
be determined by qualitatively or quantitatively determining the influence of the individual codon 
on a achieving the selected phenotype and it does not require, therefore, the calculation of codon 
frequencies or translational efficiencies as heretofore described. 

35 [0054J Advantageously, phenotypic preferences for synonymous codons can be determined 
by introducing into the organism of interest or into a related organism, or into a part of such 
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organisms a synthetic construct that comprises a regulatory polynucleotide operably linked to a 
tandem repeat of a codon fused in frame with a reporter polynucleotide that encodes a reporter 
protein, which produces, or which is predicted to produce, the selected phenotype or a phenotype of 
the same class as the selected phenotype. In one embodiment, the reporter protein is a phenotype- 
associated polypeptide (e.g., a melanoma specific antigen such as BAGE or GAGE-1) that will be 
the subject of producing the selected phenotype (e.g., immunity to melanoma). In another 
embodiment, the phenotype-associated polypeptide (e.g., green fluorescent protein or a 
gastrointestinal associated antigen such as 17-1 A) is incapable of producing the selected phenotype 
(e.g., immunity to melanoma) but produces the same class of phenotype (e.g., an immune response) 
as the selected phenotype. In illustrative examples, the phenotype-associated polypeptide is 
selected from antigens including antigens from pathogenic organisms or cancers (e.g., wherein the 
phenotype is immunity to disease) and self antigens or transplantation antigens (e.g., wherein the 
phenotype is antigen-specific anergy or tolerance), growth factors (e.g., wherein the phenotype is 
selected from size of the organism or part, wound healing, cell proliferation, cell differentiation, 
cell migration, immune cell function), hormones (e.g., wherein the phenotype is increased lactation, 
e.g., using oxytocin, or amelioration of a diabetic state, e.g., using insulin) and toxins (e.g., wherein 
the phenotype is tumour regression or cell death). 

(0055) The design of the synthetic construct is preferably based on the synthetic construct 
design of Frazer et al in International Publication WO 00/42215, which is predicated on the 
determination that different but synonymous stretches of identical codons fused respectively in 
frame with a reporter polynucleotide can give rise to different levels of reporter protein produced 
within a given cell type. The level of reporter protein produced in a selected cell type is sensitive to 
the intracellular abundance of the iso-tRNA species corresponding to the identical codons of the 
construct and, therefore, provide a direct correlation of a cell's preference for translating a given 
codon. However, in contrast to WO 00/42215, which is primarily concerned with determining 
translational efficiencies of synonymous codons in selected cell types, the present invention is 
concerned with using synthetic constructs to determine the influence of their synonymous stretches 
of identical codons on the phenotype or class of phenotype displayed by the organism or part in 
response to the phenotype-associated protein produced by those synthetic constructs. This means, 
for example, that if the quality of the phenotype displayed by a selected organism or part thereof to 
which a synthetic construct having a tandem series of identical first codons is provided is lower 
than the quality of the phenotype displayed by the selected organism or part to which a synthetic 
construct having a tandem series of identical second codons is provided (i.e., wherein the first 
codons are different than, but synonymous with, the second codons), then it can be deduced that the 
selected organism or part has a higher preference for the second codon than the first codon with 
respect to the quality of the phenotype produced. Put another way, the second codon has a higher 
phenotypic preference than the first codon in the selected organism or part. 
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(0056) Suitably, the tandem repeat comprises at least three identical codons, preferably 
between four and seven identical codons and more preferably five or six identical codons, 

(0057J The tandem repeat can be fused at a location adjacent to, or within, the reporter 
polynucleotide. Tins location is preferably selected such that the tandem repeat interferes with 
5 translation of at least a portion of the phcnotype-associated protein. Preferably, the tandem repeat is 
located immediately upstream (translationally) from the reporter polynucleotide. 

(0058] It is of course possible that a tandem repeat of identical amino acid residues (e.g., an 
oligo-proline repeat) can render the phenotype-associated protein unstable. Typically, protein 
instability is detected when the phenorype conferred by expression of the reporter polynucleotide is 
10 not detectable with any choice of isoaccepting codon specific for the amino acid corresponding to 
the tandem repeat. However, such protein instability can be alleviated by use of at least one spacer 
codon within the tandem repeat of identical codons, wherein the spacer codon encodes a neutral 
amino acid. 

[0059] The spacer codon(s) can be placed adjacent to, or interposed between, some or all of 
15 the identical codons corresponding to the tandem repeat. For example, a suitable interposition for a 
pen to-repeat of identical codons can be selected from the group consisting of: (a) I-S-I-S-I-S-I-S-I- 
S; (b) S-I-S-I-SJ-S-I-S-I; (c) I-S-I-S-M-S-I; (d) I-S-I-I-S-I-S-I; (e) I-S-I-S-I-I-I; (0 M-S-I^-I-I; 
(g) I-H-S-I-S-I; (h) I-S-I-l-S-M; (i) M-S-M-S-I; (j) I-M-H-S-I; (k) tS-M-M; (1) H-S-H-I; (m) 
I-I-I-S-M; and (n) M-I-I-S-I, wherein I corresponds to an identical codon of a tandem repeat and S 
20 corresponds to a spacer codon. 

[0060] Preferably, a spacer codon is efficiently translated in a cell type of the organism or the 
organism itself relative to other synonymous codons. This is important so that translation of the 
spacer codon is not rate limiting. The neutral amino acid includes, but is not restricted to, alanine 
and glycine. 

25 10061] The reporter polynucleotide can encode any suitable protein that confers upon the 
organism or part, either by itself or in association with other molecules, a phenotype or class of 
phenorype. In specific embodiments, the selected phenotype or class of phenotype corresponds to a 
beneficial or improved or superior state or condition of the organism or part thereof relative to a 
reference state or condition. In illustrative examples, the reference state or condition corresponds to 

30 a pathophysiological state. Phenotypes contemplated by the present invention include any desirable 
beneficial trait including, but not restricted to: immunity (e.g., immunity to pathogenic infection or 
cancer); antigen tolerance (e.g., antigen-specific T lymphocyte anergy, tolerance to allergens, 
transplantation antigens and self antigens); angiogenesis (e.g., blood vessel formation in the heart 
and vasculature and in tumour growths); anti-angiogenesis (e.g., treatment of ischaemic heart 

35 disease and tumours); amelioration of clinical symptoms (e.g., fever; inflammation; encephalitis; 

weight loss; anaemia; sensory symptoms such as paresthesia or hypaesthesia; ataxia; neuralgia; 
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paralysis; vertigo; urinary or bowel movement abnormalities; and cognitive dysfunction such 
memoty loss, impaired attention, p^blem-solving difficulties, slowed information processing, and 
difficulty in shifting between cognitive tasks); reduced or increased ceil death (e.g., apoptosis); 
reduced or increased cell differentiation; reduced or increased cell proliferation; tumour or cancer 
5 regression; growth and repair of tissue or organ; decreased fibrosis; inhibition or reversal of cell 
senescence; increased or reduced cell migration; differential expression of protein between 
different cells or tissues of an organism or part thereof; trauma recovery; recovery from burns, 
antibiotic resistance or sensitivity (e.g., resistance or sensitivity to aminoglycoside antibiotics such 
as geneticin and paromomycin); herbicide tolerance or sensitivity (e.g. tolerance or sensitivity to 
10 glyphosate or glufosinate); starch biosynthesis or modification (e.g. using a starch branching 
enzyme, starch synthases, ADP-glucose pyrophosphorylase); fatty acid biosynthesis (e.g. using a 
desatwase or hydroxylase); disease resistance or tolerance (e.g., resistance to animal diseases such 
as cardiovascular disease, autoimmunity, Alzheimer's disease, Parkinson's disease, diabetes, AIDS 
etc or resistance to plant diseases such as rust, dwarfism, rot, smut, mould, scab and mildew); pest 
15 resistance or tolerance including insect resistance or tolerance (e.g., resistance to borers and 
worms); viral resistance or tolerance (e.g. resistance to animal viruses such as herpesviruses, 
hepadnavimses, adenoviruses, flaviviruses, lentivrruses, poxviruses etc or resistance to plant 
viruses such as badnaviruses, caulimoviruses, potyviruses, luteoviruses, rhabdoviruses etc); fungal 
resistance or tolerance (e.g., resistance to arbuscular mycorrhizal fungi, endophytic fungi etc); a 
20 metabolic trait including sucrose metabolism (e.g., sucrose isomerisation); frost resistance or 
tolerance; stress tolerance (e.g., salt tolerance, drought tolerance); and improved food content or 
increased yields. Persons of skill in the art will recognise that the above exemplary classes of 
phenotype may be subdivided into phenotypic subclasses and that such subclasses would also fall 
within the scope of phenotypic classes contemplated by the present invention. For example, 
25 subclasses of immunity include innate immunity (which can be further subdivided inter alia into 
complement system, monocytes, macrophages, neutrophils and natural killer cells), cellular 
immunity (which can be further subdivided inter alia into cytolytic T lymphocytes, dendritic cells 
and T helper lymphocytes) and humoral immunity (which can be further subdivided inter alia into 
antibody subclasses IgA* IgD, IgE, IgG and IgM). 

30 [0062] The instant method is applicable to prokaryotic as well as eukaryotic hosts and 1 
includes unicellular organisms and especially to multicellular organisms, such as but not limited to 
yeast, plants and animals including vertebrate animals such as mammals, reptiles, fish, birds etc as 
well as invertebrate animals such as metazoa, sponges, worms, molluscs, nematodes, crustaceans, 
echinoderms etc. In certain embodiments of the present invention, the organism of interest is 

35 selected from plants and mammals. 

[0063] Illustrative examples of eukaryotic organisms include, but are not limited to, fungi 
such as yeast and filamentous fungi, including species of Aspergillus, Trichoderma, and 
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Neurospora\ animal hosts including vertebrate animals illustrative examples of which include fish 
(e.g., salmon, trout, tulapia, tuna, carp, flounder, halobut, swordfish, cod and zebrafish), birds (e.g., 
chickens, ducks, quail, pheasants and turkeys, and other jungle foul or game birds) and mammals 
(e.g., dogs, cats, horses, cows, buffalo, deer, sheep, rabbits, rodents such as mice, rats, hamsters and 

5 guinea pigs, goats, pigs, primates, marine mammals including dolphins and whales, as well as cell 
lines, such as human or other mammalian cell lines of any tissue or stem cell type (e.g., COS, NIH 
3T3 CHO, BHK, 293, or HeLa cells), and stem cells, including pluripotent and non-pluripotent and 
embryonic stem cells, and non-human zygotes), as well as invertebrate animals illustrative 
examples of which include nematodes (representative generae of which include those that infect 

10 animals such as but not limited to Ancylostoma, Ascaridia, Ascaris, Bwwstomum, Caenorhabditis, 
Capillaria, Chabertia, Cooperia, Dictyocaulus, Haemonchus, Heterakis, Nematodes, 
Oesophagostomum, Ostevtagia, Oxyuris, Parascaris, Stron$>fas, Toxascaris, Trichuris, 
Trichostrongylus, Tflichonema, Toxocara, UndnOiia, and those that infect plants such as but not 
limited to Bursaphalenchus, Criconerriella, Diiylenchus.Ditylenchus, Globodera, Helicotylenchus , 

15 Heterodera, Longidorus, Melodoigyne, Nacobbus, Paratylenchus, Pratylenchus, Radopholus, 
Rotelynchus, Tyletichus, and JSphinerna) and other worms, drosophila, and other insects (such as 
from the families Apidae, Curculionidae, Scarabaeidae, Tephritidae, Tortricidae, amongst others, 
representative orders of which include Coleoptera, Diptcra, Lepidoptera, and Homoptera. 

[0064] In certain embodiments, the organism of interest is a plant which is suitably selected 
20 from monocotyledons, dicotyledons and gymnosperms. The plant may be an ornamental plant or 
crop plant. Illustrative examples of ornamental plants include, but are not limited to, Malus spp, 
Crataegus spp, Rosa spp., Betula spp, Sorbus spp, Olea spp, Nerium spp, Salix spp, Populus spp. 
Illustrative examples of crop plants include plant species which are cultivated in order to produce a 
harvestable product such as, but not limited to, Abetmoschns esculentus (okja), Acacia spp., Agave 
25 . fourcroydes (henequen), Agave sisalana (sisal), Albizia spp., Allium fistuloswn (bunching onion), 
Allium sativum (garlic), Allium spp. (onions), Alpinia galcinga (greater galanga), Amaranthus 
caudatus, Amaranthus spp., Anacardium spp. (cashew), Ananas comosus (pineapple), Anethum 
graveolens (dill), Annona clterimola (cherimoya), Apios americana (American potatobean), 
Arachis hypogaea (peanut), Arctium spp. (burdock), Artemisia spp. (wormwood), Aspalathus 
30 linearis (redbush tea), Athertonia diversifolia, Airiplex nummularia (old man saltbush), Averrhoa 
carambola (starfhrit), Aiadirachia indica (neem), Backfwusia spp., Bambusa spp. (bamboo), Beta 
vtdgaris (sugar beet), Boehmeria nivea (ramie), bok choy 7 Boronia megastigma (sweet boronia), 
Brassica carinata (Abyssinian mustard), Brassica juncea (Indian mustard), Brassica napus 
(rapeseed), Brassica oleracea (cabbage, broccoli), Brassica oleracea var Albogabra (gai lum), 
35 Brassica parachinensis (choi sum), Brassica pekensis (Wong bok or Chinese cabbage), Brassica 
spp., Burcella obovata, Cajanus cajan (pigeon pea), Camellia sinensis (tea), Cannabis sativa (non- 
drug hemp), Capsicum spp., Carica spp. (papaya), Carthamus tinctorius (safflower), Carum carvi 
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(caraway), Cassinia spp., Castanospermwn austral* (blackbean), Casuarina cunninghamiana 
(bcefwood), Ceratonia sitiqua (carob), Chamaemelum nobile (chamomile), Chamelaucium spp. 
(Geraldton wax), Chenopodium guinea (quinoa), Chrysanthemum (Tanacetum), 
(pyrethrum), Qcer arietinum (chickpea), Oc/h>™,« Omatt -pp., CMa-A-f 

5 /bnmf (Stuffs desert pea), Cta »cjM (coconut), Cotfea spp. (coffee), Colocasia esculenta 
(taro), Mn»» (coriander), Crambe abyssinica (crambe), Ot>*« sativus (saffron), 

C UC «rWto/^to»«o (buffalo gourd), Cucurbita spp. (gourd), Qmiqp* tetragonobba (guar), 
Cym&W>» spp. (lemongrass), Qrinf proliferus (tagasaste), Zto emtt (carrot), 
Desmanihus spp., ZJtocorea «c*toi/a (Asiatic yam), Dioscorea spp. (yams), Z>iaw»w spp. 
10 (persimmon), ltoMfcn sp., Echinacea spp., £bM*-rir *b* (water chestnut), Elevsine 
coracana (finger millet), Auirihtf anmdinaceus, Eragrostis tef (tef), ^tortus «n<»<Wet*, 
Eriobotrvajaponica (loquat), £«a./vpn« spp., ftcoftptaf s PP . (gil mallee), £«ctea spp., &««tto 
Mbcewb (jumba), ApfarMi spp., Euphoria longana (longan), Am» wasabi (wasabi), 
Fago^rwm es^/en/um (buckwheat), Festuca arundinacea (tall fescue), F/citf spp. (fig), 
15 Flacourtia biennis, Flmdersia grayliana (Queensland maple), Foeniculum olearia, Foeniculum 
vulgare (fennel), Garcinia mangostana (mangosteen), Glycine latifolia, Glycine max (soybean), 
Glycine max (vegetable soybean), Glycyrrhiza glabra (licorice), Gow^/uin spp. (cottons), 
Grevillea spp., GnWe/te spp., Guizotia abyssinica (niger), Harpagophyllum sp., tfe/ian//™ 
a»««w (high oleic sunflowers), Mianthus annuus (monosun sunflowers), /fe/ian//»tf tuberosus 
20 (Jerusalem artichoke), iflta cannabis (kenaf), /fonta /fonfe™ spp. (waxy 

barley), Hordeum vulgare (barley), Hordeum vulgare subsp. spontaneum, Humulus lupulus (hops), 
/Zyrfrarfis eai»«fe»sis (golden seal), Hymenachne spp., //^opus officinalis (hyssop), /n^o/em 
spp., /nga edulis (ice cream bean), i«oca/yw tugiter, Ipomoea batatas (sweet potato), Ipomoea sp. 
(kang kong), LaMao purpureus (white lablab), lactoca spp. (lettuce), Lathyrus spp. (vetch), 
25 Lavandula spp. (lavender), ^ spp. (lentil), Lesquerella spp. (bladderpod), W»e«a spp., Lilium 
spp., Limnanthes spp. (meadowfoam), JUmmi usitatissimum (flax), £im»» vsitatissimum (linseed), 
Z,m«m usitatissimum (Linola.TM.), IflcW c/if»e;»w (lychee), lo/ttf comiculatus (birdsfoot trefoil), 
Icto pedunculate, Lotus sp., Luffa spp., I«»<ma omm (honesty), Lupinus mutabilis (pearl 
lupin), Lupinus spp. (lupin), Macatoa spp, Mangifera indica (mango), Mmftof ^ct^W 
30 (cassava), Medicago spp. (lucerne), Medicago spp., Metoca spp. (tea tree), Melaleuca undnata 
(broombush), Jtfe»/ta tasmannia, Mentha spicata (spearmint), Mentha X piperita (peppermint), 
Momordica charaniia (bitter melon), Musa spp. (banana), Myrciaria caulijlora (jaboticaba), 
Myrotliamnus flabellifolia, Nepheliunt lappaceum (rambutan), Nerine spp., Ocimum basilicum 
(basil), Oenanthe javanica (water dropwort), Oenothera biennis (evening primrose), Olea 
35 europaea (olive), Olearia sp., Origanum spp. (marjoram, oregano), Oryza spp. (rice), Owfo 
ju&erwa (oca), Oiothamnus spp. (rice flower), Pachyirhizus ahipa (yam bean), Panax spp. 
(ginseng), Panicum miliaceum (common millet), Papaver spp. (poppy), Parthenium argentatum 
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(guayule), Passiflora sp., Paulownia iomemtosa (princess tree), Pelargonium graveolens (rose 
geranium), Pelargonium sp., Pennisetum americanum (bulrush or pearl millet), Persoonia spp,, 
Petroselinum crispum (parsley), Phacelia ianacetifolxa (tansy), Phalaris canadensis (canary grass), 
Phalaris sp., Phaseolus coccineus (scarlet runner bean), Phaseolus lunatus (lima bean), Phaseolus 
spp., Phaseolus vulgaris (culinary bean), Phaseolus vulgaris (navy bean), Phaseolus vulgaris (red 
kidney bean), Pisum sativitm (field pea), Plantago ovata (psyllium), Polygonum minus, Polygonum 
odoratum, Pruuus mwne (Japanese apricot), Psidium guajava (guava), Psophocarpus 
tetragonolobus (winged bean), Pyrus spp. (nashi), Raphamts satulus (long white radish or Daikon), 
Rhagodia spp. (saltbush), Ribes nigrum (black currant), Ricinus communis (castor bean), 
Rosmarinus officinalis (rosemary), Rungia klossii (rungia), Saccharum officinarum (sugar cane), 
Salvia officinalis (sage), Salvia sclarea (clary sage), Salvia sp., Sandersonia sp., Santalum 
acuminatum (sweet quandong), Santalum spp. (sandalwood), Sclerocaiya cqffra (marula), 
Scutellaria galei'iculata (scullcap), Secale cereale (rye), Sesamum indicum (sesame), Setaria 
Uatica (foxtail millet), Simmondsia spp. (jojoba), Solanum spp., Sorghum almutn (sorghum), 
Stachys betonica (wood betony), Stenanthemum scortecheniU Strychnos cocculoides (monkey 
orange), Stylosanthes spp. (stylo), Syz)>gium spp., Tasmannia lanceolate (mountain pepper), 
Tenninalia karnbachii, Theobroma cacao (cocoa), Thymus vulgaris (thyme), Toona australis (red 
cedar), Trifoliium spp. (clovers), Trifolium alexandrinum (berscem clover), Trifolium resupinatum 
(persian clover), Diticum spp., Triticum tauschii, Tylosema esculentwn (morama bean), Valeriana 
Sp. (valerian), Vernonia spp., Vetiver zizanioides (vetiver grass), Vicia benghalensis (purple vetch), 
Vicia faba (faba bean), Vicia narbonensis (narbon bean), Vicia sativa ) Vicia spp., Vigna 
aconitifolia (rnothbean), Vigna angularis (adzulti bean), Vigna mungo (black gram), Vigna radiala 
(mungbean), Vigna spp., Vigna unguiculaia (cowpea), Vitis spp. (grapes), Voandzeia subterranea 
(bambarra groundnut), Triticosecale (triticale), lea mays (bicolour sweetcorn), lea mays (maize), 
Zea mays (sweet corn), Zea mays subsp. mexicana (teosinte), Zieria spp., Zingiber officinale 
(ginger), Zizania spp, (wild rice), Ziziphus jujuba (common jujube). Desirable crops for the 
practice of the present invention include Nicotiana tabacum (tobacco) and horticultural crops such 
as, for example, Ananrn comosus (pineapple), Saccharum spp (sugar cane), Musa spp (banana), 
Lycopcrsicon esadenium (tomato) and Solanum tuberosum (potato). 

[0065] The synthetic constructs of the invention may be introduced directly into an organism 
of interest or into one or more of its parts, e.g., cell or tissue types (e.g., a muscle, skin, brain, lung, 
kidney, pancreas, a reproductive organ such as testes, ovaries and breast, eye, liver, heart, vascular 
cell, root, leaf, flower, stalk or meristem) or into an organ of the organism. Alternatively, the 
synthetic construct is introduced into a progenitor of the organism and the progenitor is then grown 
or cultured for a time and under conditions sufficient to produce the organism of interest, whereby 
the synthetic construct is contained in one or more cell types of that organism. Suitable progenitor 
cells include, but are not limited to, stem cells such as embryonic stem cell, pluripotential immune 
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cells, meristematic cells and embryonic callus. In certain embodiments, the synthetic construct is 
introduced into the organism of interest using a particular route of administration (e.g. for 
mammals, by the oral, parenteral (e.g., intravenous, intramuscular, intraperitoneal, intavcntricular, 
intrarticular), mucosal (e.g., intranasal, intrapulmonary, oral, buccal, sublingual, rectal, 

5 intravaginal), dermal (topical, subcutaneous, transdermal); for plants, administration to flowers, 
meristem, root, leaves or stalk). Practitioners in the art will recognise that the route of 
administration will differ depending on the choice of organism of interest and the sought-after 
phenotype. Desirably, the synthetic constructs are introduced into the same or corresponding site. 
In other embodiments, the synthetic construct is introduced into a cell of the organism of interest 

10 (e.g., autologous cells), or into a cell that is compatible with the organism of interest (e.g., 
syngeneic or allogeneic cells) and the genetically-modified cell so produced is introduced into the 
organism of interest at a selected site or into a part of that organism. 

[0066J The synthetic constructs of the invention may be introduced into an organism of 
interest or part thereof using any suitable method, and the kind of method employed will differ 
15 depending on the intended cell type, part and/or organism of interest. For example, four general 
classes of methods for delivering nucleic acid molecules into cells have been described: (1) 
chemical methods such as calcium phosphate precipitation, polyethylene glycol (PEG)-mediate 
precipitation and lipofection; (2) physical methods such as microinjection, electroporation, 
acceleration methods and vacuum infiltration; (3) vector based methods such as bacterial and viral 
20 vector-mediated transformation; and (4) receptor-mediated. Transformation techniques that fall 
within these and other classes are well known to workers in die art, and new techniques are 
continually becoming known. The particular choice of a transformation technology will be 
determined by its efficiency to transform certain host species as well as the experience and 
preference of the person practising the invention with a particular methodology of choice. It will be 
25 apparent to the skilled person that the particular choice of a transformation system to introduce a 
synthetic construct of the invention into cells is not essential to or a limitation of the invention, 
provided it achieves an acceptable level of nucleic acid transfer. Thus, the synthetic constructs are 
introduced into tissues or host cells by any number of routes, including viral infection, phage 
infection, microinjection, electroporation, or fusion of vesicles, lipofection, infection by 
30 Agrobacterium tumefaciens or A. rhizogenes, at protoplast fusion. Jet injection may also be used 
for intra-muscular administration (as described for example by Furth et al, 1992, Anal Biochem 
205:365-368). The synthetic constructs may be coated onto microprojectiles, and delivered into a 
host cell or into tissue by a particle bombardment device, or "gene gun" (see, for example, Tang et 
al, 1992, Nature 356:152-154). Alternatively, the synthetic constructs can be fed directly to, or 
35 injected into, the host organism or it may be introduced into the cell (i.e., intracellularly) or 
introduced extracellularly into a cavity, interstitial space, into the circulation of an organism, 
introduced orally, etc. Methods for oral introduction include direct mixing of the synthetic 
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constructs with food of the organism. In certain embodiments, a hydrodynamic nucleic acid 
administration protocol is employed (e.g., see Chang et al, 2001, J. Virol. 75:3469-3473; Liu et al, 
199% Gene 77?er. 6:1258-1266; Wolff et al, 1990, Science 247:14654468; Zhang et al, 1999, 
Hum, Gene Tlier. 10:1735-1737; and Zhang et ai, 1999, Gene Tlxer. 7:1344-1349). Other methods 
5 of nucleic acid delivery include, but are not limited to, liposome-mediated transfer, naked DNA 
delivery (direct injection) and receptor-mediated transfer (ligand-DNA complex). 

[0067] In accordance with the present invention, the individual synthetic constructs are 
separately introduced into test organisms which are preferably selected from organisms of the same 
species as the organism of interest or organisms that are related to the organism of interest, or into 

10 test parts of such organisms. Related organisms are generally species within the same phylum, 
preferably species within the same subphylum, more preferably species within superclass, even 
more preferably species within the same class, even more preferably species within the same order 
and still even more preferably species within the same genus. For example, if the organism of 
interest is human, a related species is suitably selected from mouse, cow, dog or cat, which belong 

15 to the same class as human, or a chimpanzee, which belongs to the same order as human. 
Alternatively, if the organism of interest is banana, the related organism may be selected from taro, 
ginger, onions., garlic, pineapple, bromeliaeds, palms, orchids, lilies, irises and the like, which are 
all non-graminaceous monocotyledonous plants and which constitute horticultural or botanical 



20 [0068] After introduction of the synthetic constructs into the test organisms or parts, the 
qualities of their phenotypes are determined by a suitable assay and then compared to determine the 
relative phenotypic preferences of the synonymous codons. The quality is suitably a measure of the 
strength, intensity or grade of the phenotype, or the relative strength, intensity or grade of two or 
more desired phenotypic traits. 

25 [0069] Assays for various phenotypes conferred by the production of a chosen reporter 
protein are known by those of skill in the art. For example, immunity may be assayed by any 
suitable methods that detects an increase in an animal's capacity to respond to foreign or disease- 
specific antigens (e.g., cancer antigens) i.e., those cells primed to attack such antigens are increased 
in number, activity, and ability to detect and destroy the those antigens. Strength of immune 

30 response is measured by standard tests including: direct measurement of peripheral blood 
lymphocytes by means known to the art; natural killer cell cytotoxicity assays (see, e.g., Provinciali 
et al (1992, 1 Immunol Metlu 155: 19-24), cell proliferation assays (see, e.g., Vollenweider and 
Groseurth (1992, J. Immunol Meth. 149: 133-135), immunoassays of immune cells and subsets 
(see, e.g., Loeffier et al (1992, Cytom. 13: 169-174); Rivoltini et al (1992, Can, Immunol. 

35 Immwwther. 34: 241-251); or skin tests for cell-mediated immunity (see, e.g., Chang et al (1993, 
Cancer Res. 53: 1043-1050). Enhanced immune response is also indicated by physical 



relatives. 
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manifestations such as fever and inflammation, as well as healing of systemic and local infections, 
and reduction of symptoms in disease, i.e., decrease in tumour si2e, alleviation of symptoms of a 
disease or condition including, but not restricted to, leprosy, tuberculosis, malaria, naphthous 
ulcers, herpetic and papillomatous warts, gingivitis, atherosclerosis, the concomitants of AIDS 
5 such as Kaposi's sarcoma, bronchial infections, and the like. Such physical manifestations may also 
be used to detect, or define the quality of, the phenotype or class of phenotype displayed by an 
organism. Alternatively, herbicide tolerance may be assayed by treating test organisms (e.g., plants 
such as cotton plants), which express a herbicide tolerance gene (e.g., glyphosate tolerance protein 
gene such as a glyphosate resistant EPSP synthase), with a herbicide (e.g., glyphosate) and 
10 determining the efficacy of herbicide tolerance displayed by the plants. For example, when 
' determining the efficacy of synthetic constructs for conferring herbicide tolerance in cotton, the 
amount of boll retention is a measure of efficacy and is a desirable trait. 

[007O] The qualities of selected phenotype displayed by the test organisms or by the test parts 
are then compared to provide a ranked order of the individual codons, encoding a specific amino 

15 acid, that are tandemly repeated in the synthetic constructs according to their preference of usage 
by the organism or part to confer the selected phenotype. One of ordinary skill in the art will 
thereby be able to determine a "codon preference table" for .each amino acid in the polypeptide 
whose expression conveys the selected phenotype to the organism of interest. Comparison of 
synonymous codons within a codon preference table can then be used to identify codons for 

20 • tailoring a synthetic polynucleotide to modulate the quality of a selected phenotype. 

3. Selection of synonymous and first codons 

(0071J to accordance with the present invention, a comparison of phenotypic preferences can 
be determined for synonymous codons in an organism of interest or in a related organism, or in test 
parts thereof, which can be used as a basis for constructing a synthetic polynucleotide from which a 

25 phenotype-associated polypeptide is producible in the organism of interest or part thereof. The 
synthetic polynucleotide is constructed so that its expression in the organism or part confers a 
selected phenotype upon that organism or part but in a different quality than that conferred by a 
parent polynucleotide that encodes the same polypeptide. The method comprises selecting a first 
codon of the parent polynucleotide for replacement with a synonymous codon, wherein the 

30 synonymous codon is selected on the basis that it exhibits a different phenotypic preference than 
the first codon in a comparison of phenotypic preferences in the organism of interest or in a related 
organism, or in a part thereof, as determined in Section 2. The first codon is then replaced with the 
synonymous codon to construct the synthetic polynucleotide, 

[00721 Thus, in accordance with the present invention, a parent polynucleotide can be 
35 modified with synonymous codons such that quality of the selected phenotype conferred by the 
polynucleotide so modified (synthetic polynucleotide) is higher than from the parent 
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polynucleotide Generally, the difference between the respective phenotypic qualities conferred by 
a synthetic polynucleotide and by a parent polynucleotide depends on the number of first codons 
that are replaced by synonymous codons, and on the difference in phenotypic preference between 
the first codons and the synonymous codons in the organism of interest or part thereof. Put another 
5 way, the fewer such replacements, and/or the smaller the difference in phenotypic preference 
between the synonymous and first codons, the smaller the difference will be in the phenotypic 
quality between the synthetic and parent polynucleotides. Conversely, the more such replacements, 
and/or the greater the difference in phenotypic preference between the synonymous and first 
codons, the greater the difference will be in the phenotypic quality between the synthetic and parent 
10 polynucleotides. 

(0073] In one embodiment, the present invention contemplates a method of constructing a 
synthetic polynucleotide that encodes a phenotype-associated polypeptide and that confers a higher 
quality of a selected phenotype displayed by an organism of interest or part thereof than the quality 
conferred by a parent polynucleotide that codes for the same polypeptide. In this embodiment, a 
15 first codon of the parent polynucleotide is selected for replacement with a synonymous codon, 
wherein the synonymous codon is selected on the basis that it exhibits a higher phenotypic 
preference than the first codon in a comparison of phenotypic preferences in the organism of 
interest or in a related organism, or in a part thereof. 

(00741 In accordance with the present invention, a higher quality of a selected phenotype can 
20 be achieved by selecting a synonymous codon that has a higher phenotypic preference than the first 
codon. Generally, a higher phenotypic preference will correlate with a higher quality of the selected 
phenotype. Thus, in a non-limiting example of such a correlation, a synonymous codon is deemed 
to have at least about a 5% higher phenotypic preference than a first codon when the quality of 
phenotype displayed by an organism or part thereof to which a synthetic construct comprising a 
25 tandem repeat of the synonymous codon has been provided is at least about 5% higher than the 
quality of phenotype displayed by an organism or part thereof to which a synthetic construct 
comprising a tandem repeat of the first codon has been provided. When selecting the synonymous 
codon, it is preferred that it has a phenotypic preference in the organism of interest that is at least 
about 105%, suitably at least about 110%, preferably at least about 120%, more preferably at least 
30 about 1 30%, even more preferably at least about 140%, even more preferably at least about 1 50%, 
even more preferably at least about 160%, even more preferably at least about 170%, even more 
preferably at least about 180%, even more preferably at least about 190%, even more preferably at 
least about 200%, even more preferably at least about 250%, even more preferably at least about 
300%, even more preferably at least about 350%, even more preferably at least about 400%, even 
35 more preferably at least about 450%, even more preferably at least about 500%, even more 
preferably at least about 550%, even more preferably at least about 600%, even more preferably at 
least about 650%, and still even more preferably at least about 700% of the phenotypic preference 
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of the first codon. In the case of two or more synonymous codons having similar phenotypic 
preferences, it will be appreciated that any one of these codons can be used to replace the first 
codon. Generally, if a parent polynucleotide has a choice of low and intermediate phenotypic 
preference codons, it is preferable in the first instance to replace some, or more preferably all, of 

5 the low phenotypic preference codons with synonymous codons having intermediate, or preferably 
high, phenotypic preferences. Typically, replacement of low with intermediate or high phenotypic 
preference codons results in a substantial increase in the quality of the phenotypc conferred by the 
synthetic polynucleotide so constructed. However, it is also preferable to replace some, or 
preferably all, of the intermediate phenotypic preference codons with high translationally efficient 

10 codons for conferring an optimal quality in the selected phenotype. 

[0075) In another embodiment, the present invention contemplates a method of constructing a 
synthetic polynucleotide that encodes a phenotype-associated polypeptide and that confers a lower 
quality of a selected phenotype displayed by an organism of interest or part thereof than the quality- 
confen-ed by a parent polynucleotide that codes for the same polypeptide. In this embodiment, a 
15 first codon of the parent polynucleotide is selected for replacement with a synonymous codon, 
wherein the synonymous codon is selected on the basis that it exhibits a lower phenotypic 
preference than the first codon in a comparison of phenotypic preferences in the organism of 
interest or in a related organism or in a part thereof. 

(0076] In accordance with the present invention, a lower quality of a selected phenotype can 
20 be achieved by selecting a synonymous codon that has a lower phenotypic preference than the first 
codon. A lower phenotypic preference will typically correlate with a lower quality of the selected 
phenotype. Accordingly, in a non-limiting example of such a correlation, a synonymous codon is 
deemed to have at least about a 5% lower phenotypic preference than a first codon when the quality 
of phenotype displayed by an organism or part thereof to which a synthetic construct comprising a 
25 tandem repeat of the synonymous codon has been provided is at least about 5% lower than the 
quality of phenotype displayed by an organism or part thereof to which a synthetic construct 
comprising a tandem repeat of the first codon has been provided- When selecting the synonymous 
codon for this embodiment, it is preferred that it has a phenotypic preference in the organism of 
interest that is at least about 95%, suitably at least about 90%, preferably at least about 85%, more 
30 preferably at least about 80%, even more preferably at least about 75%, even more preferably at 
least about 70%, even more preferably at least about 65%, even more preferably at least about 
60%, even more preferably at least about 55%, even more preferably at least about 50%, even more 
preferably at least about 45%, even more preferably at least about 40%, even more preferably at 
least about 35%, even more preferably at least about 30%, even more preferably at least about 
35 25%, even more preferably at least about 20%, even more preferably at least about 1 S%, even more 
preferably at least about 10%, and still even more preferably at least about 5% of the phenotypic 
preference of the first codon. 
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wi th synonymous codons haw* higher or lower phcnotyp* preference in the organism of mterest 
, oi part thereof than the first codons. For example, a higher or lower phenotypic quah* -can be 
accomplished even with partial replacement Typically, the rep.acement step affects S* 10* 
5 IS*/. 20%, 25%. 30%, more preferably 35%, 40%. 50%. 60%. 70% or more of the first codons of 
the parent polynucleotide, b a preferred embodnnent requiring a h.gher phenotypic quahty. the 
number of, and difference in phenotyp.c preference between, the first codons and the synonymous 
codons are selected such that the phenotype-assoc.ated polypeptide is produced from the synthetic 
polynucleotide to confer a phenotype upon a chosen organism or organism part in a quahty that rs 
10 at least about 110%, suitably at least about 150%, preferably at least about 200%, more preferably 
" at least about 250%, even more preferably at least about 300%. even more preferably at least about 
350% even more preferably at least about 400%, even more preferably at least about 450%, even 
m0 re preferably at least about 500%, and still even more preferably at least about 1000%, of the 
quality of phenotype conferred by the parent polynucleotide in the organism or part. Conversely, m 
15 a preferred embodiment requiring a lower phenotypic quality, the number of, and difference m 
phenotypic preference between, the first codons and the synonymous codons are selected such that 
the phenotype-associated polypeptide is produced from the synthetic polynucleotide to confer a 
phenotype upon a chosen organism or part thereof in a quality that is no more than about 90%. 
suitably no more than about 80%, preferably no more than about 70%, more preferably no more 
20 than about 60%, even more preferably no more than about 50%, even more preferably no more 
than about 40%, even more preferably no more than about 30%, even more preferably no more 
than about 20%, even more preferably no more than about 10%, and still even more preferably no 
more than about 5%, of the quality of phenotype conferred by the parent polynucleotide m the 
organism or part. 

25 4. Construction of synthetic polynucleotides 

10078] Replacement of one codon for another can be achieved using standard methods known 
in the art For example codon modification of a parent polynucleotide can be effected using several 
known mutagenesis techniques including, for example, ohgonucleoude-directed mutagens, 
mutagenesrs with degenerate oligonucleotides, and region-specific mutagenesis. Exemplary ft. «*o 
30 mutagenesis techniques are desenbed for example in U.S. Patent No, 4,184,917. 4.321.365 and 
4 351901 or in the relevant sections of Ausubel. et al. (CURRENT PROTOCOLS IN 
MOLECULAR BIOLOGY, John Wiley & Sons, Inc. 1997) and of Sambrook. et al., 
(MOLECULAR CLONING. A LABORATORY MANUAL, Cold Spring Harbor Press. 1989). 
Instead of in vitro mutagenes lSj the synthetic polynucleotide can be synthesised de novo usmg 
readily available machinery as desenbed, for example, in U.S. Patent No 4.293.652. However, rt 



35 
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should be noted that the present invention is not dependent on, and not directed to, any one 
particular technique for constructing the synthetic polynucleotide. 

(0079) The parent polynucleotide is suitably a natural gene. However, it is possible that the 
parent polynucleot.de that is not naturally-occurring but has been engineered using recombinant 
5 techniques. Parent polynucleotides can be obtained from any suitable source, such as from 
eukaryotic or prokaryotic organisms, including but not limited to mammals or other annuals, and 
pathogenic organisms such as yeasts, bacteria, protozoa and viruses. 

[0080] The invention also contemplates synthetic polynucleotides encoding one or more 
desired portions of the phenotype-associated polypeptide. In this regard, it is preferable that the 
10 synthetic polynucleotide encodes at least one functional domain of the phenotype-assocated 
polypeptide, which is preferably at least about 10, more preferably at least about 20, even more 
preferably at least about 50, even more preferably at least about 100, even more preferably at least 
about 150, and still more preferably at least about 500 contiguous amino acid residues of that 
polypeptide. 

15 [0081] The mvcntion further contemplates a synthetic construct (or expression vector), 
comprising a synthetic polynucleotide of the invention, wmch is operably linked to a regulatory 
polynucleotide. The regulatory polynucleotide suitably comprises transcriptional and/or 
translational control sequences, which will be compatible for expression in the organism of interest 
or in cells of that organism. Typically, the transcnpuonal and translational regulatory control 
20 sequences include, but are not limited to, a promoter sequence, a 5' non-coding region, a c«- 
regulatory region such as a functional binding site for transcriptional regulatory protein or 
translational regulatory protein, an upstream open readmg frame, ribosomal-binding sequences, 
transcnptional start site, translational start site, and/or nucleotide sequence which encodes a leader 
sequence, termination codon, translational stop site and a 3' non-translated region. Constitutive or 
25 inducible promoters as known in the art are contemplated by the invention. The promoters may be 
either naturally occurring promoters, or hybrid promoters that combine elements of more than one 
promoter. Promoter sequences contemplated by the present invention may be native to the 
organism of interest or may be derived from an alternative source, where the region is functional in 
the chosen organism. The choice of promoter will differ depending on the intended host. For 
30 example, promoters which could be used for expression m plants include plant promoters such as: 
constitutive plant promoters examples of which include CaMV35S plant promoter, CaMV19S plant 
promoter, FMV34S plant promoter, sugarcane bacilliform badnavirus plant promoter, CsVMV 
plant promoter, Arabidopsis ACT2/ACT8 actin plant promoter, Arabidopsis ubiquitin OBQ1 plant 
promoter, barley leaf thionin BTH6 plant promoter, and rice actin plant promoter, tissue specific 
35 plant promoters examples of which include bean phaseolin storage protein plant promoter, DLEC 
plant promoter, PHS.beta. plant promoter, zein storage protein plant promoter, conglutin gamma 
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plant promoter from soybean, AT2S1 gene plant promoter, ACT11 actin plant promoter from 
Arabidopsis, napA plant promoter from Bvassica napus and potato patatin gene plant promoter; and 
• inducible plant promoters examples of which mclude a light-inducible plant promoter derived from 
the pea rbcS gene, a plant promoter from the alfalfa rbcS gene, DRE, MYC and MYB plant 
5 promoters which are active in drought; INT, MPS. prxEa, Ha hs P 17.7G4 and RD21 plant 
promoters active in high salinity and osmotic stress, and hsr203J and str246C plant promoters 
active in pathogenic stress. Alternatively, promoters which could be used for expression in 
mammals include the metallothionein promoter, which can be induced in response to heavy metals 
such as cadmium, the 0-actin promoter as well as viral promoters such as the SV40 large T antigen 
10 promoter, human cytomegalovirus (CMV) immediate early (IE) promoter, rous sarcoma virus LTR 
" promoter, adenovirus promoter, or a HPV promoter, particularly the HPV upstream regulatory 
region (URR) may also be used. All these promoters are well described and readily available in the 
art. 

[0082J The synthetic construct of the present invention may also comprise a 3' non-translated 
15 sequence. A 3' non-translated sequence refers to that portion of a gene comprising a DNA segment 
that contains a polyadenylation signal and any other regulatory signals capable of effecting mRNA 
processing or gene expression. The polyadenylation signal is characterised by effecting the addition 
of polyadenylic add tracts to the 3' end of the mRNA precursor. Polyadenylation signals are 
commonly recognised by the presence of homology to the canonical form 5' AATAAA-3' 
20 although variations are not uncommon. The 3' non-translated regulatory DNA sequence preferably 
includes from about 50 to 1,000 nucleotide base pairs and may contain transcriptional and 
translations termination sequences in addition to a polyadenylation signal and any other regulatory 
signals capable of effecting mRNA processing or gene expression. 

100831 In a preferred embodiment, the synthetic construct further contains a selectable marker 
25 gene to permit selection of an organism or a precursor thereof that contains the synthetic construct. 
Selection genes are well known in the art and will be compatible for expression in organism or its 
precursor. 

(00841 It will be understood, however, that expression of protein-encoding polynucleotides in 
heterologous systems is now well known, and the present invenhon is not directed to or dependent 
30 on any particular vector, transcriptional control sequence or technique for its production. Rather, 
synthetic polynucleotides prepared according to the methods set forth herein may be introduced 
into an organism of interest or into a precursor or progenitor thereof in any suitable manner in 
conjunction with any suitable synthetic construct or vector, and the synthetic polynucleotides may 
be expressed with known promoters in any conventional manner. 

35 [00851 In some embodiments, the synthetic constructs of the invention are in the form of viral 

vectors, such as simian virus 40 (SV40) or bovine papilloma virus (BPV), which has the abmty to . 
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replicate as extra-chromosomal elements (Eukaryotic Viral Vectors, Cold Spring Harbor 
Laboratory, Gluzman ed., 1982; Sarver et al, 1981, Mol. Cell Biol. 1:486). Viral vectors include 
retroviral (lentivirus), adeno-associated virus (see, e.g., Okada, 1996, Gene Ther. 3:957-964; 
Muzyczka, 1994, J. Clin. Invst. 94:1351; U.S. Pat. Nos. 6,156,303; 6,143,548 5,952,221, 
5 describing AAV vectors; see also U.S. Pat. Nos. 6,004,799; 5,833.993), adenovirus (see, e.g., U.S. 
Pat. Nos. 6,140,087; 6,136,594; 6,133,028; 6,120,764), reovirus, herpesvirus, rotavirus genomes 
etc., modified for introducing and directing expression of a polynucleotide or transgene in cells. 
Retroviral vectors can include those based upon murine leukemia virus (see, e.g., U.S. Pat. No. 
6,132,731), gibbon ape leukemia virus (see, e.g., U.S. Pat. No. 6,033,905), simian immuno- 
10 deficiency virus, human immuno-deficiency virus (see, e.g., U.S. Pat. No. 5,985,641), and 
combinations thereof. 

[0086] Vectors also include those that efficiently deliver genes to animal cells in vivo (e.g., 
stem cells) (see, e.g., U.S. Pat. Nos. 5,821,235 and 5,786,340; Croyle et al., 1998, Gene Tlier. 
S:645; Croyle et al., 1998, Pharm. Res. 15:1348; Croyle et al, 1998, Hum. Gene 77>er. 9:561; 

15 Foreman et al., 1998, Hum. Gene Tlxer. 9:1313 ; Wirtz et al., 1999, Gut 44:800). Adenoviral and 
adeno-associated viral vectors suitable for in vivo delivery are described, for example, in U.S. Pat. 
Nos. 5,700,470, 5,731,172 and 5,604,090. Additional vectors suitable for in vivo delivery include 
herpes simplex virus vectors (see, e.g., U.S. Pat. No. 5,501,979), retroviral vectors (see, e.g., U.S. 
Pat. Nos. 5,624,820, 5,693,508 and 5,674,703; and WO92/05266 and W092/14829), bovine 

20 papilloma vims (BPV) vectors (see, e.g., U.S. Pat. No. 5,719,054), CMV-bascd vectors (see, e.g., 
U.S. Pat. No. 5,561,063) and parvovirus, rotavirus and Norwalk virus vectors. Lentiviral vectors 
are useful for infecting dividing as well as non-dividing cells (see, e.g., U.S. Pat. No. 6,013,5 16). 

(0087] Vectors for insect cell expression commonly use recombinant variations of 
baculoviruses and other nucleopolyhedrovirus, e.g., Bombyx mori nucleopolyhedrovinis vectors 

25 (see, e.g., Choi, 2000, Arch. Virol. 145:171-177). For example, Lepidopteran and Coleopteran cells 
are used to replicate baculoviruses to promote expression of foreign genes carried by baculoviruses, 
e.g., Spodoptera frugiperda cells are infected with recombinant Autographa califomica nuclear 
polyhedrosis viruses (AcNPV) carrying a heterologous, e.g., a human, coding sequence (see, e.g., 
Lee, 2000, J. Virol 74:11873-11880; Wu, 2000, J. Biotechnol. 80:75-83). See, e.g., U.S. Pat. No. 

30 6,143,565, describing use of the polydnavirus of the parasitic wasp Glyptapanteles indiensis to 
stably integrate nucleic acid into the genome of Lepidopteran and Coleopteran insect cell lines. See 
also, U.S. Pat. Nos. 6,130,074; 5,858,353; 5,004,687. 

[0088] Expression vectors capable of expressing proteins in plants are well known in the art, 
and include, e.g., vectors from Agrobacterium spp., potato virus X (see, e.g., Angell, 1997, £MBO 
35 J. 16:3675-3684), tobacco mosaic virus (see, e.g., Casper, 1996, Gene 173:69-73), tomato bushy 
stunt virus (see, e.g., Hillman, 1989, Virology 169:42-50), tobacco etch virus (sec, e.g„ Dolja, 
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1997, Virology 234:243-252), bean golden mosaic virus (see, e.g., Morinaga, 1993, Microbiol 
Immunol. 37:471-476), cauliflower mosaic virus (see, e.g., Cecchini, 1997, Mol Plant Microbe 
Interact 10:1094-1 101), maize Ac/Ds transposable element (see, e.g., Rubin, 1997, Mol Cell Biol 
17:6294-6302; Kunze, 1996, Cwr. Top. Microbiol Immunol 204:16M94), and the maize 
5 suppressor-mutator (Spm) transposable element (see, e.g., Schlappi, 1996, Plant Mol Biol 32:717- 
725); and derivatives thereof. 

10O89J The invention further contemplates organisms containing therein the synthetic 
construct of the invention, or alternatively, parts, precursors, cells or tissues produced by the 
methods of the invention. 

10 [0090J In order that the invention may be readily understood and put into practical effect, 
particular preferred embodiments will now be described by way of the following non-limiting 
examples. 
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EXAMPLES 

EXAMPLE 1 

Codon-modif.cation of an antigen-encoding pdynnc.eotide to enhance the antibody response 

of a mammal to that antigen 

5 Reaeents: 

l00 91] For each set of synonymous codons that code for a single amino acid, a set of 2*6 gene 
constructs is constructed, each encoding green fluorescent protein with a leader sequence 
comprising six copies of an anuno acid. Each member of the set is .dentical except that, across the 
" set, the leader sequence comprises 6 copies of each of the possible synonymous codon tnplets 
1 0 (XXX) that code for the amino acid. 

[0092] The general format of these gene constructs would be: 
]0093] Eukaryotic Promoter - ATG - (XXX)6 - GFP gene - polyadenylation signal 
10094] the nucleic add and deduced amino acids sequences of the GFP gene are set forth in 
SEQ ID NO: 1 and 2, respectively. The nucleic acid and deduced amino acids sequences of the 
15 different leader sequences are set forth in SEQ ID NO: 3 to 120. 

t 0095] A superset of 20 such sets of gene constructs is thus constructed, encoding all possible 
naturally occurring amino acids for which there are redundant coding triplets, comprising in total 
59 gene constructs. 

Method: 

20 100961 Groups of 8-10 6 wk old Balb/c female mice are immunised transcutaneous^ using a 
biolistic delivery system (e.g., the Biorad gene gun or the Powderject delivery system) with one of 
the 59 gene constructs, on one to three occasions at three weekly mtervals. Plasmid (1-2 ^g) are 
coated onto the gold beads (Biorad) or sugar (Powderject) material according to the manufacturers 
instructions. Beads are delivered to the shaven abdominal skin of the mouse. The Powderject 

25 system is optmused for delivery to the basal epidermis. The Biorad system has the capacity to alter 
delivery depth by adjusting the gas pressure used for the delivery. Data are collected for gas 
pressures optimised for delivery to the superficial, or basal epithelium, or to the dermts. This ,s 
calibrated for the particular gene gun and beads by delivenng beads to skin and examinmg 
microscopic sections to score the percentage of beads at various depths in the dermis and 
30 epidermis. 

l0 097] Serum samples are taken by retro-orbital bleeding at weeks 3 and 6 post immumsation 
and by terminal cardiac puncture at week 9 post immunisation and the titre of antibody to GFP 
determined by ELISA for each serum. For ELISA, GFP (Sigma) is coated onto plates and serum 
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reactivity detennined according to standard protocols (see, e.g., Walsh et al, 2000, J Gen Urol. 
709-718). 

Outcome and interpretation: 
[0O98J The mean antibody titre to GPF induced by each set of constructs encoding a given 
amino acid is ranked to determine the optimal codon for antibody production using a 
polynucleotide delivered by a biolistic delivery system (conversely the optimal codon to avoid 
antibody production). 

(00991 A list of optimal codons is thus be produced. An improved gene construct for a 
polynucleotide vaccine delivered by the tested delivery system to skin and designed to induce 
maximal antibody titre is one in which codons in the gene of interest (e.g., influenza 
hemagglutinin) which are not Optimal are (in large measure) replaced by ones that are optimal for 
producing maximal antibody titre. 

EXAMPL E2 

Codon-modification of an antigen-encoding polynucleotide to confer tolerance to that antigen 
15 in a mammal 

Rea gents: 

[01 00] For eaeh set of synonymous codons that code for a single amino acid, a set of 2-6 gene 
constructs is constructed, each encoding glutamic acid decarboxylase (GAD) with a leader 
sequence comprising six copies of an amino acid. Each member of the set is identical except that, 
20 across the set, the leader sequence comprises 6 copies of each of the possible synonymous codon 
triplets (XXX) that code for the amino acid. 
[0101] The general format of such gene constructs would be: 
[0102J Eukaryotic Promoter - ATG - (XXX)6 - GAD gene - polyadenylation signal 
[0103] The nucleic acid and deduced amino acids sequences of the GAD gene are set forth in 
25 SEQ ID NO: 12 1 and 1 22, respectively. 

(0104J A superset of 20 such sets of gene constructs is thus constructed, encoding all possible 
naturally occurring amino acids for which there are redundant coding triplets, comprising in total 
59 gene constructs. 

Methods: 

30 10105] Groups of 4 week old NOD (non obese diabetic) female mice are immunised 

transcutaneously using a biolistic delivery system (e.g., the Biorad gene gun or the Powderject 

delivery system) with one of the 59 gene constructs, on one to three occasions at three weekly 

intervals. Plasrnid (1-2 jag) are coated onto the gold beads (Biorad) or sugar (Powderject) material 

- 30 - 

PAGE 38/1 12 * RCVD AT 5/5/2005 3:24:30 AM [Eastern Daylight Time] * SVR:DCS-CS01/1 * DNIS:2047 * CSID: * DURATION (mrn-ss):3346 



05-05-' 05 17:35 FROM- T-330 P039/112 F-313 

# 

WO20MMMM PCT/AU2(M>J/0OI487 
according to the manufacturers instructions. Beads are delivered to the shaven abdominal skin of 
the mouse. The Powderject system is optimised for delivery to the basal epidermis. The Biorad 
system has the capacity to alter delivery depth by adjusting the gas pressure used for the delivery. 
Data are collected for gas pressures optimised for delivery to the superficial, or basal epithelium, or 
5 to the dermis. This is calibrated for the particular gene gun and beads by delivering beads to skin 
and examining microscopic sections to score the percentage of beads at various depths in the 
dermis and epidermis. 

[0106] The mice arc followed to the onset of diabetes as assessed by a blood sugar level over 
10 rnM/t on weekly blood samples obtained by retro-orbital bleeding. Onset of diabetes is 
10 confirmed by testing for glycosuria, and pancreatic histology determined on the diabetic mice, and 
on mice deemed free of diabetes at 26 weeks. 

Outcome and i nterpretation: 
[0107] The mean time to onset of diabetes for mice immunised with each set of constructs 
encoding a given amino acid would be ranked, to determine the optimal codon for expressing the 
15 gene of interest in the sites responsible for prevention of a pathogenic immune response, using a 
polynucleotide delivered by a biolistic delivery system. 

(0108] A list of optimal codons for ensuring protein expression from a polynucleotide vaccine 
at the optimal sites to induce host protective immunity (tolerance) is thus produced. An improved 
gene construct for a polynucleotide vaccine delivered by the tested delivery system to skin and 
20 designed to induce maximal host protective immune tolerance is one in which codons in the gene 
of interest (e.g., insulin with a view to controlling human diabetes) which are not optimal are (in 
large measure) replaced by ones that are optimal. 

EXAMPLE 3 

Codoo-modification for optimising the resistance of transgenic plants to herbicides while 
25 maintaining seed protein production. 

Reagents: 

[0109] For each set of synonymous codons that code for a single amino acid, a set of 2-6 gene 
constructs is constructed, each encoding phosphinothricin acetyltransferase (encoded by the bar 
gene) with a leader sequence comprising six copies of an amino acid. Each member of the set is 
30 identical except that, across the set, the leader sequence comprises 6 copies of each of the possible 
synonymous codon triplets (XXX) that code for the amino acid. 

[0110] The general format of such gene constructs would be: 

[01 1 1] Plant Promoter ATG - (XXX)6 - BAR gene - plant polyadenylation signal 
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[01121 The nucleic acid and deduced amino acids sequences of the BAR gene arc set forth in 
SEQIDNO: 123 and 124, respectively. 

[0113] A superset of 20 such sets of gene constructs is thus constructed, encoding all possible 
naturally occurring amino acids, comprising in total 59 gene constructs. 

5 Method: 

[0114] The method of Weeks et al (1993, Plant Physiol 102(4): 1077-1084) for the 
production of multiple independent lines of fertile transgenic wheat (Triticum aestivum). Briefly, 
calli derived from immature embryos, 0.5 to 1 mm long, are bombarded with microprojectiles 
coated with DNA containing as marker genes the bar gene, encoding phosphinothricin-resistance, 

10 and the gene encoding p-glucuronidase (GUS), each under control of a maize ubiquihn promoter. 
The bombardment is performed 5 d after embryo excision, just after initiation of callus 
proliferation. The ability of plantlets to root in the presence of 1 or 3 mg/L of bialaphos is a reliable 
selection criteria used to identify transformed plants. Stable transformation is confirmed by marker 
gene expression assays and the presence of the bar sequences in high molecular weight 

15 chromosomal DNA of the resultant plants. On average, 1 68 d elapses between embryo excision for 
bombardment and anthesis of the TO plants. The transmission of both the resistance phenotype and 
bar DNA to the Tl generation verified that gerrnline transformation had occurred. 

[0115] Herbicide resistance is measured using the method of Singh and Wright (2002, Lett 
Appl Microbiol. 35(1): 12-1 6) and protein concentration in seed is measured by the method of 
20 Turner et al (1990, Plant MoL Biol 14(5):793-803). 

Outcome and interpretation: 
(01161 For each amino acid leader sequence, plants are ranked according to their resistance to 
the Basta Herbicide, and to their seed protein content. The optimal codon from each set is the one 
which gives plants adequate herbicide resistance and the maximum seed protein yield, 

25 [01 17 J A list of optimal codons is thus be produced, and an improved gene construct for a 
gene conveying herbicide resistance to a transgenic grain plant is one in which codons in the gene 
of interest (say those in the broad host range plasmid pJP4, which carries genes for the degradation 
of 2,4-dicUorophenoxyacetic acid (2>D), 2-methyM-chlorophenoxyacetic acid, and 3- 
chlorobenzoic acid (see e.g., Kleinsteuber, et al 7 2001, £xtre?nophiles. 5(6):375-384) which are not 

30 optimal are (in large measure ) replaced by ones that are optimal. 
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EXAMPLE 4 

Codon-modifiCatioD for optimising the selectivity of a growth inhibitory gene for controlling 
a skin tumour without damage to surrounding normal skin. 

Reaperits: 

5 (01 1 8] For each set of synonymous codons that code for a single amino acid, a set of 2^6 gene 
constructs is constructed, each encoding an E2F1 dominant negative inhibitor with a leader 
sequence comprising six copies of an amino acid. Each member of the set is identical except that, 
across the set, the leader sequence comprises 6 copies of each of the possible synonymous codon 
triplets (XXX) that code for the amino acid. 
10 (01 i?J The general format of such gene constructs would be: 

[0120J Eukaryotic Promoter - ATG - (XXX)6 - E2Fldn gene - IRES - GFP gene- 
polyadenylation signal 

[01211 The nucleic acid and deduced amino acids sequences of the E2F1 dominant negative 
inhibitor gene are set forth in SEQ ID NO: 125 and 126, respectively. 

1 5 (0122) A superset of 20 such sets of gene constructs is thus constructed, encoding all possible 
naturally occurring amino acids for which there are redundant coding triplets, comprising in total 
59 gene constructs. 

Methods; 

(0123) Tumours are induced on the skin of FVB mice by painting with chemical carcinogens 
20 using a standard DMBA/TPA promotion/induction regime. Groups of tumour bearing female mice 

are treated topically using a biohstic delivery system (e.g., the Biorad gene gun or the Powderject 
delivery system) with one of the 59 gene constructs, on one to three occasions at three weekly 
intervals. Plasmid (1-2 fig) are coated onto the gold beads (Biorad) or sugar (Powderject) material 
according to the manufacturers instructions. Beads are delivered to the shaven abdominal skin of 
25 the mouse. The Powderject system is optimised for delivery to the basal epidermis. The Biorad 
system has the capacity to alter delivery depth by adjusting the gas pressure used for the delivery. 
Data are collected for gas pressures optimised for delivery to the superficial, or basal epithelium, or 
to the dermis. This is calibrated for the particular gene gun and beads by delivering beads to skin 
and examining microscopic sections to score the percentage of beads at various depths in the 
30 dermis and epidermis. 

(0124) Twenty-four hours after treatment, the skin from the tumour and surrounding normal 
skin is excised and disaggregated into single cells. Cells are sorted for expression of the GFP by 
high speed flow cytometry, and the clonogenic potential of the GFP positive cells assessed for each 
construct, and for normal and tumour tissue. Clonogenic potential is established by plating cells on 
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collagen coated plates in minimal KGM medium, and by counting colonies established per 10000 
input cells. The ratio of colonies obtained from normal tissue to colonies obtained from tumour 
tissue is measure of the efficiency of targeting of that particular construct to the tumour tissue - 
higher colony forming efficiency for the normal skin cells in comparison to the tumour cells 
5 indicates a more preferred codon for correctly delivering gene therapy to the tumour. 

Outcome and interpretation; 

10125) The mean colony forming efficiency ratio (normal/tumour cells) are ranked to 
determine the optimal codon for inhibiting tumour growth while preserving normal skin, using a 
polynucleotide delivered by a biolistic delivery system. A list of optimal codons is thus produced. 

10 An improved gene construct for treating skin cancers delivered by the tested delivery system to 
skin and designed to induce maximal selectivity for the tumour over normal tissue is one in which 
codons in the gene of interest for skin cancer treatment (for example p53) most resemble the 
optimal list. 

10126) The disclosure of every patent, patent application, and publication cited herein is 
15 hereby incorporated herein by reference in its entirety. 

(0127] The citation of any reference herein should not be construed as an admission that such 
reference is available as "Prior Art" to the instant application. 

[0128] Throughout the specification the aim has been to describe the preferred embodiments 
of the invention without limiting the invention to any one embodiment or specific collection of 
20 features. Those of skill in the art will therefore appreciate that, in light of the instant disclosure, 
various modifications and changes can be made in the particular embodiments exemplified without 
departing from the scope of the present invention. AH such modifications and changes are intended 
to be included within the scope of the appended claims. 



-34- 



PAGE 42/1 12 ^ RCVD AT 5/5/2005 3:24:30 AM [Eastern Daylight Time] * SVR:DCS-CS01/1 * DNIS:2047 * CSID: * DURATION (mm-ss):3346 



05-05-' 05 17:37 FROM- 



T-330 P043/112 F-313 



# 



WO HUMMUS* PCI7AU2.HU/0OJ4R7 

WHAT IS CLAIMED IS: 

1 . A method for constructing a synthetic polynucleotide from which a polypeptide 
is producible to confer a selected phenotype upon an organism of interest or part thereof in a 
different quality than that conferred by a parent polynucleotide that encodes M same 
polypeptide, the method comprising: (a) selecting a first codon of the parent polynucleotide for 
replacement with a synonymous codon, wherein the synonymous codon is selection the basis 
that it exhibits a different phenotypic preference than the first codon in a/comparison of 
phenotypic preferences in test organisms or parts thereof, wherein the t&t organisms are 
selected from the group consisting of organisms of the same species as tlWorganism of interest 
and organisms that are related to the organism of interest; and (b) replacing the first codon with 
the synonymous codon to construct the synthetic polynucleotide. 

2. A method according to claim 1, wherein the phenotypic preferences of codons 
in the test organisms or parts are compared by: (i) separately introducing into the test organisms 
or parts individual synthetic constructs, each of which comprises a regulatory polynucleotide 
operably linked to a tandem repeat of a codon fused in fram/with a reporter polynucleotide that 
encodes a reporter protein, which produces, or which is/predicted to produce, a corresponding 
phenotype selected from the group consisting of the selected phenotype and a phenotype of the 
same class as the selected phenotype; and (ii) ytomparing the quality of the phenotypes 
displayed by the test organisms or parts to detemfine the relative phenotypic preferences of the 
codons. 

3. A method according to c\/im 1, wherein the reporter protein produces the 
selected phenotype. 

4. A method according l/claim 1, wherein the reporter protein does not produce 
the selected phenotype but produces/ttie same class of phenotype as the selected phenotype. 

5. A method according to claim 3 or claim 4, wherein the reporter protein is 
selected from antigens derived from pathogenic organisms, cancer antigens, self antigens, 
transplantation antigens, growth factors, hormones and toxins. 

6. A metho/ according to claim 1, wherein the phenotype is selected from 
immunity, antigen toterance, angiogenesis, anti-angiogenesis, amelioration of clinical 
symptoms, reduced J increased cell death, reduced or increased cell differentiation, reduced or 
increased cell proJHeration, tumour or cancer regression, growth and repair of tissue or organ, 
decreased fibroins, inhibition or reversal of cell senescence, increased or reduced cell migration, 
differential depression of protein between different cells or tissues of an organism or part 
thereof, traW recovery, recovery from burns, antibiotic resistance or sensitivity, herbicide 
tolerance or sensitivity, starch biosynthesis or modification, fatty acid biosynthesis , disease 
resistance or tolerance, pest resistance or tolerance including insect resistance or tolerance, viral 
rejfetance or tolerance, fungal resistance or tolerance, a metabolic trait including sucrose 
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metabolism, frost resistance or tolerance, stress tolerance, and unproved food content/* 
increased yields. 

7 A method according to claim 1, wherein the phenotype is an immune response. 

8. A method according to claim 7, wherein the immune response is/humoral 

immune response. / 

9. A method according to claim 7, wherein the immune response is^ cell m< dialed 

immune response. 

10. A method according to claim 1, wherein the immune response is an innate 

immunity mediated response. 

U. A method according to claim 1, wherein the synthetic/constructs are introduced 

into the test organisms using the same or similar mode of inttoductym. 

12. A method according to claim 1, wherein the syndetic constructs are introduced 
into the test organisms at the same or corresponding site . 

13. A method according to claim 1. wherein tb/organism of interest is a mammal 
and the synthetic constructs are introduced by oral, i/travenous, intramuscular, intranasal, 
buccal, subcutaneous, transdermal, buccal or sublingualroute. 

14. A method according to claim 1, wh/ein the synthetic constructs are introduced 
into one or more of cell or tissue types of the organism of interest. 

15 A method according to claim 14, wherein the synthetic constructs are 
introduced into cells selected from muscle cells, skm cells, brain cells, lung cells, kidney cells, 
pancreas cells, cells of a reproductive afan. heart cells, vascular cells, liver cells, eye cells, 
flower cells, meristeroatic cells, root cells and leaf cells. 

16. A method according to claim 1, wherein the tandem repeat of each of the 
synthetic constructs comprises at/ast three copies of the corresponding codon. 

17. A method according to claim 1, wherein the synonymous codon is selected such 
that it has a higher phenotyn/ preference than the first codon. 

18. A metho/according to claim 17, wherein the synonymous codon is selected 
when the quality of th/phenotype conferred by the synthetic construct comprising a tandem 
repeat of the synonymous codon is at least about 5% higher than the quality of the phenotype 
conferred by the s/thetic construct comprising a tandem repeat of the first codon. 

19. Amethod according to claim 1, wherein the synonymous codon is selected such 
that it has a Jpwer phenotypic preference than the first codon. 

A method according to claim 19, wherein the synonymous codon is selected 
when ^quality of the phenotype conferred by the synthetic construct comprising a tandem 
rep/of the synonymous codon is at least about 5% lower than the quality of the phenotype 
conferred by the synthetic construct comprising a tandem repeat of the first codon. 

21. A method according to claim 1. wherein the organism of interest is unicellular. 
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22. A method according to claim 1, wherein the organism of interest is a 

multicellular organism. 

23. A method according to claim 22, wherein the multicellular organism is an 



animal. 

24. 
mammal. 
25. 
26. 



A method according to claim 22, wherein the multicellular organism is a 



A method according to claim 22, wherein the multicellular organism is a plant. 
A method according to claim \ y wherein the synthetic Constructs are introduced 
into progenitors of the test organisms or parts and the progenitors are grown or cultured for a 
time and under conditions sufficient to produce the test orgarufems or parts, whereby the 
synthetic constructs are contained in one or more cell types of tho/e organisms or parts. 

27. A method according to claim 26, wherein the tfrogenitor cells are selected from 
stem cells, pluripotential cells, meristematic cells and embryonic callus. 

28. A method for determining the phenotypi/ preference of a first codon in an 
organism of interest or part thereof, the method comprising: (a) introducing a synthetic 
construct into a test organism or part thereof, whercm the test organism is selected from the 
group consisting of an organism of the same species is the organism of interest and an organism 
that is related to the organism of interest, the Aithetic construct comprising a regulatory 
polynucleotide operably linked to a tandem repeat of the first codon fused in frame with a 
reporter polynucleotide that encodes a reporter^rotein, which produces, or which is predicted to 
produce, a selected phenotype or a phenotytfe of the same class as the selected phenotype; and 
(b) determining the quality of the corresponding phenotype displayed by the organism or part. 

29. A method according to ^laim 28, further comprising: comparing (i) the quality 
of the corresponding phenotype displayed by a test organism or part thereof to which a synthetic 
construct comprising a tandem repeat of the first codon was provided; and (ii) the quality of the 
corresponding phenotype display^ by a test organism or part thereof to which a synthetic 
construct comprising a tandem/repeat of a second codon was provided, wherein the second 
codon encodes the same ami/o acid as the first codon, to thereby determine the phenotypic 
preference of the first codgft relative to the phenotypic preference of the second codon in the 
test organism or part. 

method according to claim 28, further comprising: (1) introducing the 
a progenitor of a test organism or part thereof; and (2) producing the 
from the progenitor, wherein the test organism or part contains the 



30. A 
synthetic construct j: 
test organism or 



synthetic const 

31. A method according to claim 28, further comprising: (1) introducing the 
synthetic construct into a progenitor of a test organism or part thereof; and (2) growing the test 
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organism or pan from the progenitor, wherein Ihe test onanism or part comprises e,ceil 

containing the synthetic construct. 

32. A method according to claim 28, further comprising: introducing th/synthetic 

construct into a selected cell of the test organism or part. 

33 . A synthetic polynucleotide constructed according to claim 1 or/aim 28. 

34. An organism of interest or part thereof containing a syndic polynucleotide 
constructed according to claim 1 or claim 28. 

35. An organism of interest or part thereof containing / synthetic construct that 
comprises a regulatory polynucleotide operably linked to a tandem/peat of a first codon fused 
in fi^me with a reporter polynucleotide that encodes a reporter protein, which produces, or 
which is predicted to produce, a selected phenotypc or a pj&iotype of the same class as the 
selected phenotype in the organism or part. 

36. A method of modulating the quality of/ elected phenotype that is displayed by 
an organism of interest or part thereof and that/results from the expression of a parent 
polynucleotide that encodes the polypeptide, t)/method comprising: introducing into the 
organism or part a synthetic polynucleotide th/is distinguished from the parent polynucleotide 
by the replacement of a first codon in the t>4nt polynucleotide with a synonymous codon that 
exhibits a different phenotypic prefereno/than the first codon in the organism or part. 

37. A method of enhancing the quality of a selected phenotype that is displayed by 
an organism of interest or part/toereof and that results from the expression of a parent 
polynucleotide that encodes tffe polypeptide, the method comprising: introducing into the 
organism or part a synthetic/olynucleotide lhat is distinguished from the parent polynucleotide 
by the replacement of a first codon in the parent polynucleotide with a synonymous codon that 
exhibits a higher pheiyfypic preference than the first codon in the organism or part. 

38. A dtethod of reducing the quality of a selected phenotype that is displayed by 
an organism o^inteTest or part thereof and that results from the expression of a parent 
polynucleotid/ that encodes the polypeptide, the method comprising: introducing into the 
organism c/part a synthetic polynucleotide that is distinguished from the parent polynucleotide 
by the emplacement of a first codon in the parent polynucleotide with a synonymous codon that 
exhibits a lower phenotypic preference than die first codon in the organism or part. 
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cllO> The University of Queensland (all states, except U.S.) 
Frazer, Ian Hector (U.S. only) 

<120> A method for optimising gene expression 
<130> 12178192/VPA 

<14 0> Unassigned 
<141> 2003-11-10 

<150> USSKf 60/425,163 
<151> 2002-11-08 

<160> 126 

<170> Patentin version 3.2 

<210> 1 

<211> 714 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Humanised GFP 



*220> 

<22l> CDS 

<222> (1) - - (711) 



aqc aag ggc gag gaa ctg ttc act ggc gtg gtc cca att etc gtg gaa 
Ser Lys G?y Glu Slu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu 
j. 5 10 

r ^ rr^t- ata aat qqq cac aaa ttt tct gtc age gga gag ggt 
2 C Z £ S £ Sy His Ly» Ph. ser v.! S« «y 3!u 0!y 
20 25 

gaa ggt gat gec aca tac gga aag etc -ace ctg aaa ttc ate tge acc 
Glu 3y Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr 
35 40 45 . 

„<- „„ n aaa etc cct gtg cca tgg cca aca ctg gtc act ace ttc tct 
Thr Sy Lys 2u So ^ Pro Tr P Pro Thr Leu Val Thr Thr Phe Ser 
50 55 60 

rat . aa _ ata C aq tgc ttt tec aga tac cca gac cat atg aag cag cat 
G?y va! Gin cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin Hi. 

65 

n ^ ttt ttc aag age gee atg ccc gag ggc tat gtg cag gag aga ace 
Zl Phe Phe Lys Ser La Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
85 90 2? 

^ i-t-f t-tc aaa qat gac ggg aac tac aag acc cgc get gaa gtc aag 
X Phe Phe LyJ fsp Asp S mb Tyr Lys Thr Arg Ala Glu Val Lys 

ttc gaa ggt gac acc ctg gtg aat aga ate gag ctg aag ggc att gac 

• 1 - 
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Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp 
115 120 i" 5 



10 



Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 
20 25 3 

Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr 
35 40 45 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu val Thr Thr Phe ser 
50 55 60 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His MeC Lys Gin His 



432 



480 



526 



ttt aag gag gat gga aac att etc ggc cac aag ctg gaa tac aac tat 
Phe Lys Glu Asp Gly Asn He Leu Gly Has Lys Leu Glu Tyr Asn ryr 
130 135 140 

aac tec cac aat gtg tac ate atg gec gac aag caa aag aat ggc ate 
A6 n ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly lie 
145 150 I 55 

aag gtc aac ttc aag ate aga cac aac att gag gat gga tec gtg cag 
Lys III Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser Val Gin 
165 l™ 175 

' ctg gec gac cat tat caa cag aac act cca ate ggc gac ggc cct gtg 57 6 

Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val 
180 185 190 

etc etc cca gac aac cat tac ctg tec ace cag tct gec ctg tct aaa 624 
III HI Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 
195 200 205 

gat cec aac gaa aag aga gac cac atg gtc ctg ctg gag ttt gtg ace 
Lp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 
210 215 220 

get get ggg ate aca cat ggc atg gac gag ctg tac aag tga 
Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 2 

<211> 237 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Humanised GFP 

<400> 2 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 



672 



714 



-2- 
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Asp Phe Phe Lys Ser Ala Met Pro Gltt Gly Tyr Val Gin oltt Arg Thr 

90 " 



85 



tie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 
100 105 110 

Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp 
115 120 I 25 

Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr 
130 135 "0 

Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly lie 



Asn 

145 150 



155 I 60 



Lys val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val Gin 
165 170 175 

Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 
180 185 190 

Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 
195 200 205 

Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 
210 215 220 



Ala Ala Gly He Thr HisTciy Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 3 

<211> 18 

<212> DNA 

*213> Artificial Sequence 
<220> 

<223> Ala (GCA) x6 



<220> 

*221> CDS 
*222> (1)-.(18) 

*400> 3 

gca gca gca gca gca gca 
Ala Ala Ala Ala Ala Ala 
1 5 



<210> 4 
<211> 6 
<212> PRT 
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<213> Artificial Sequence 
<220> 

*223>* Ala(GCA)x6 
<400> 4 

Ala Ala Ala Ala Ala Ala 



*210> S 

<211> 18 

<2l2> DNA 

<213> Artificial Sequence 
<220> 

<223> Ala (GCG) X.6 



<220> 

<221> CDS 
<c222> (1) • . (18) 

<400> 5 18 
gcg gcg gcg gcg gcg gcg 
Ala Ala Ala Ala Ala Ala 
1 5 



<210> 6 

<2H> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Ala (GCG) x6 

<400> 6 

Ala Ala Ala Ala Ala Ala 
1 5 



<210> 1 

<211> IB 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ala(GCT)x6 



<220? 

<221> CDS 
<222> 11).. (18) 

<400> 7 18 
get get get get get get 
Ala Ala Ala Ala Ala Ala 



- 4 - 
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<2io> e 

<21l> 6 
<2X2> 

. 213> Artificial Sequence 



<220> 

<223> Ala(GCT)x6 
<400> 8 

Ala Ala Ma Ala Ala Ala 



<210> . 9 
<211> IB 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ala{GCC)x6 



T-330 P056/112 F-313 




PCT/AU2003/0014H7 



<220> 

<22X> CDS 
<222> (1) (IB) 

<400> 9 

gcc gcc gcc gcc gcc gcc 
Ala Ala Ala Ala Ala Ala 



<210> 10 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ala (GCC) x6 

<400> 10 

Ala Ala Ala Ala Ala Ala 



<210> 11 

<:211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Arg(AGA)x6 



<220> 

<221> CDS 

<222> d).-(ie) 

<400> 11 

-5 
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aga aga aga aga aga aga 
Arg Arg Arg Arg Arg Arg 



<210> 12 

<21X> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Arg (AGA) X6 

<400> 12 

" Arg Arg Arg Arg Arg Arg 
1 S 



<210> 13 

<:2H> IB 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Arg (CGA) X6 



T-330 P057/112 F-313 

PCT/AU20O3/OO1487 
18 



<220> 

<:221> CDS 
<222> (1) . 



(18) 



<400> 13 

cga cga cga cga cga cga 
Arg Arg Arg Arg Arg Arg 
1 5 



18 



<210> 14 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Arg (CGA) X6 

<400> 14 

Arg Arg Arg Arg Arg Arg 
1 5 



<210> 15 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
*220:> 

<223> Arg(CGG)x6 



<220> 
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<221> CDS 
<222> (D--UB) 



<400> 15 

egg egg egg egg egg egg 
Arg Arg Arg Arg Arg Arg 

1 5 



18 



<210> 



*2H> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Arg(CGG)x6 

<400> 16 

Arg Arg Arg Arg Arg Arg 
1 5 



<210> 17 

«c211> 18 

<21Z> DNA 

<213> Artificial Sequence 
<220> 

<223> Arg (CGT) x6 



<220> 

<221> CDS 
<222> (1) - - (18) 

<400> 17 18 
cgt cgt cgt cgt cgt cgt 
Arg Arg Arg Arg Arg Arg 



<210> 18 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Arg (CGT) X6 

<400> 18 

Arg Arg Arg Arg Arg Arg 
1 5 



<210> 19 

<211> IB 

<212> DNA 

<213> Artificial sequence 
<220> 
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<223> 



Arg (AGG) x6 



<220> 
<t221> 
<222> 



CDS 

(1) . • (18) 



<40O> 19 

agg agg agg agg agg agg 
Arg Arg Arg Arg Arg Arg 



18 



<210> 20 

,. <211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Arg (AGG) x6 

<400> 20 

Arg Arg Arg Arg Arg Arg 



<210> 21 
<2X1> 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Arg (CGC) x6 



<220> 

<221> CDS 
<222> (1)..(18) 

*400> 21 18 
cgc cgc cgc cgc cgc cgc 
Arg Arg Arg Arg Arg Arg 
1 5 



<210> 22 

*211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Arg (CGC) X6 

<400> 22 

Arg Arg Arg Arg Arg Arg 
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<212> DNA 

*213> Artificial Sequence 
<220> 

<223> Asn (AAC) x6 



<220> 

<221> CDS 
<222> (l)-.(lB) 

<400> 23 ie 
aac aac aac aac aac aac 
Asn Asn Asn Asn Asn Asn 



<210> 24 

<2ll:> 6 

*212> PRT 

<:213> Artificial Sequence 
<220> 

<223> ASH (AAC) X6 

<:400> 24 

Asn Asn Asn Asn Asn Asn 
1 5 



<210> 25 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Asn(AAT)x6 



<220> 

<221> CDS 
<222> (1)..{18) 

<400> 25 18 
aat aat aat aat aat aat 
Asn Asn Asn Asn Asn Asn 



<210> 26 

<211> 6 

<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> Asn (AAT) x6 

<400> 26 

Asn Asn Asn Asn Asn Asn 
1 5 



1 



5 



1 



5 
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<210> 27 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Asp (GAT) x6 



<220> 

<221> COS 
<222> (1).-(18) 

<400> 27 

gat gat gat gat gat gat 
Asp Asp Asp Asp Asp Asp 
1 5 



<210> 28 

<211> 6 

<212> PfcT 

<213> Artificial Sequence 
<22Q> 

<223> Asp (GAT) *6 

<400> 28 

ASP Asp Asp Asp Asp Asp 
1 5 



<210> 29 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Asp (GAC) X6 



<220> 

<221> CDS 
<222> (1) . - (18) 

<400> 29 

gac gac gac gac gac gac 
Asp Asp Asp Asp Asp Asp 
1 5 



<210> 30 

«c211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Asp (GAC) x6 



- 10- 
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<400> 30 

Asp Asp Asp Asp Asp Asp 



<210> 31 

<:211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cys(TGC)x6 



<220> 

<221> CDS 
<222> (1)^(18) 

<400> 31 XS 
tgc tgc tgc tgc tgc tgc 
Cys cys cys Cys Cys Cys 
l 5 



<210> 32 
<211> 6 
c212> PRT 

<213> Artificial Sequence 
<220> 

<223> Cys (TGC) x€ 
<400> 32 

Cys Cys Cys Cys Cys Cys 
1 * 5 



<210> 33 

<211> IB 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cys(TGT)x6 



<220> 

<221> CDS 
<222> (1)..(18) 

<40O> 33 18 
tgt tgt tgt tgt tgt tgt 
Cys Cys Cys Cys Cys Cys 
1 5 



<210> 34 

<211> 6 

<212> PRT 

<213> Artificial Sequence 



- 11 - 
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<220> 

<223> Cys(TGT)x6 



<400> 34 



cys Cys Cys cys cys eye 
1 s 



<210> 
<211> 
<212> 
<2l3> 



35 
18 
DNA 



Artificial sequence 



<220> 

<223> Gln(CAA)x6 



<220> 
<221> CPS 
<222> (l)--(lB) 

<400> 35 18 
caa caa caa caa caa caa 
Gin Gin Gin Gin Gin Gin 
1 * 



<210> 36 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Gin (CAA) x6 

<4O0> 36 

Gin Gin Gin Gin Gin Gin 
1 5 



<210> 37 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Gln(CAG)x6 



<220> 

<221> CDS 
<222> (D..(1B> 

<AO0> 37 18 
cag cag cag cag cag cag 
Gin Gin Gin Gin Gin Gin 



1 



5 
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<210> 38 

<2il> 6 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Gln(CAG)x6 
<400> 38 

Gin Gin Gin Gin Gin Gin 
1 5 



<210> 39 

<211> 18 

<212> DNA 

<212> Artificial Sequence 
<220> 

<223> Glu(GAA)x6 



<220> 

<221> CDS 
<222> (1).,{18) 

<400> 39 !8 
gaa gaa gaa gaa gaa gaa 
Glu Glu Glu GlU Glu Glu 
1 5 



<210> 40 

<211> 6 

<212* PRT 

<213> Artificial Sequence 
<220> 

<223> G1U(GAA)X6 

<400> 40 

GlU GlU Glu GlU GlU GlU 
1 5 



<210> 41 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GlU (GAG) X6 



*220> 

c221> CDS 

<222> (D..(18) 

<400> 41 18 

gag gag gag gag gag gag 
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GlU Glu Glu Gl\l Glu Glu 
1 S 



<210> 42 

<211> 6 

*212> PRT 

<213> Artificial Sequence 
<220> 

<223> Glu{GAGjX6 

<400> 42 

Glu Glu GlU Glu Glu Glu 
1 5 



T-330 P065/112 F-313 




PCT/AU2003/0014K7 



<210> 43 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Gly (GGA) X6 



<220> 

<221> CDS 
<222> <1)..<1B) 

<400> 43 

gga gga gga gga gga gga 
Gly Gly Gly Gly Gly Gly 
1 5 



<210> 44 

<211> 6 

<212> *>fcT 

<213> Artificial Sequence 
<220> 

<223* Gly (GGA) x6 

<400> 44 

Gly Gly Gly Gly Gly Gly 
1 5 



<210> 


45 


<211> 


18 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Gly(GGG)x6 


<220> 




*22l> 


CDS 



- 14- 
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05-05-' 05 17:43 FROM- 



T-330 P066/112 F-313 




WO 2MM/042059 



PCT/AU200J/0014K7 



<222> (l)..(l8) 



<400> 45 



18 



ggg ggg 999 ggg s$9 9B9 
Gly Gly Gly Gly Gly Gly 



<210> 46 

<211> 6 

<2l2> PRT 

<213> Artificial Sequence 
<220> 

<223> Gly {GGG) x6 

<400> 46 

Gly Gly Gly Gly Gly Gly 
1 5 



<210> 47 

*211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Gly (GGC) X6 



<220> 

<22l> CDS 
<222> (1)..(18) 

<400> 47 

ggc ggc ggc ggc 99c ggc 
Gly Gly Gly Gly Gly Gly 
l 5 



<210> 48 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Gly (GGC) x6 

<400> 48 

Gly Gly Gly Gly Gly Gly 
1 5 



<210> 49 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
*220> 

<223> Gly(GGT)x6 
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1 



5 



-15- 



05-05-' 05 17:43 FROM- T-330 P067/112 F-313 

• • 

PCT/AU2©03/0014*7 

WO 2004/042059 



<220> 

<221> CDS 
<222> (1) • ■ ("> 

<400> 49 

ggt ggt ggt ggt ggt ggt 
Gly Gly Gly Gly Gly Gly 



<210> SO 

<2X1> 6 

<212> PRT 

<213> Artificial Sequence 
«c220> 

<223> Gly<GGT)x6 

<400> 50 

Gly dy Giy g1 y G1 y Gly 

1 S 



<210> 51 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> His(CAC)x6 



<220> 
<22\> CDS 



<222> (1) . - U8> 

<400> 51 18 
cac cac cac cac cac cac 
His His His His His His 
1 5 



<210> 52 

<211> 6 

*212> PET 

<213> Artificial Sequence 



<220? 

<:223> His (CAC) *6 
<400> 52 

His His His His His His 
1 5 



<210> 53' 

<2H> I 8 
<2\2> DtfA 

-16- 
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, , 05-05-' 05 17:43 FROM- 

# 

WO 2004/042059 



<213> Artificial Sequence 
<220> 

<222> His (CAT) X6 



<220> 
<22l> CDS 
<222> (1) (18) 

<4O0> 53 

cat cat cat cat cat cat 
His His His His His His 
1 5 



<210> 54 

<:2H> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> His (CAT) x6 

<4Q0> 54 

His His His His His His 

1 5 



«c210> 55 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220* 

<223> Ile(ATC)x6 



<220> 
<221> CDS 
<222> (1)..(18) 

<400> 55 

ate ate ate ate ate ate 
lie He He lie He He 
1 5 



T-330 P068/112 F-313 




PCT/AU2003/U014B7 



18 



18 



<210> 


56 


<211> 


6 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Ile(ATC)x6 


<400> 


56 



He lie He He He He 
1 5 
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l' I 



05-05-' 05 17:43 FBOM- 



T-330 P069/112 F-313 





WO 2004/042059 



PCT/AU2003/00I487 



<210> 5? 
<211> 18 
<212> DWA 

<2l3> Artificial sequence 
<220> 

<223> . Ile(ATT)x6 



<220> 

<221> CDS 
<222> (1)..'{18) 

<400> 57 

act att att act att att 
He He He He He He 
1 5 



<210> 58 

<2H> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> He (ATT) x6 

<400> 58 

He He He He He He 
1 s 



<210> S9 

<211> 18 

<212> DNTA 

<213> Artificial sequence 
<220> 

<223> lie (ATA) x6 



<220> 

<221> CDS 
<222> (1)--(18) 

<400> 59 

ata ata ata ata ata ata 
He He He He He He 
1 5 



<210> 60 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> lie (ATA) x6 

^400> 60 
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-18- 



05-05-' 05 17:43 FROM- 



T-330 P070/112 F-313 




wo umiiunm 



PCT/AU200 JAM) 1487 



He He He He He He 
1 5 



<210> 61 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

*223:> Leu(CTC)x6 



<220> 

<221> CDS 
<222> (1)..(18) 

<400> 61 

etc etc etc etc etc etc 18 
Leu Leu Leu Leu Leu Leu 
1 5 



<210> 62 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Leu<CTC)x6 



<210> 63 

<211> 18 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Leu(TTG)x6 



<220> 
<221> CDS 
<222> (1)..(18) 

<400> 63 

ttg teg ttg ttg ttg ttg 18 

Leu Leu Leu Leu Leu Leu 

15 



<210> 64 

<211> 6 

<212> PRT 

<213> Artificial Sequence 



<400> 



62 



Leu Leu Leu Leu Leu Leu 
1 5 



- 19 - 
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05-05-'05 17:44 FROM- 



T-330 P071/112 F-313 





WO 2004/042059 



PCT/AU2003/OOJ487 



<220> 

<223> Leu(TTG)x6 



<4O0> 64 



Leu Leu Leu Leu Leu Leu 
X 5 



<210> 65 

<211> 18 

<212> DNA 

<213> Artificial Sequence 

<220> 

" <223> Leu(C?A)x6 



<220> 

<:221> CDS 
<222> (1)..(18) 

<400> 65 18 
eta eta eta eta eta eta 
Leu Leu Leu Leu Leu Leu 
1 5 



<210> 66 

<211> 6 

<212> PUT 

<212> Artificial Sequence 
<220> 

*223> Leu(CTA)x6 

<400> 66 

Leu Leu Leu Leu Leu Leu 
1 5 



<210> 67 

<211> IB 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Leu(CTG)x6 



<22Q> 

<22l> CDS 
<222> (1>..{1B> 

<400> 67 18 
ctg ctg ctg ctg ctg ctg 
Leu Leu Leu Leu Leu Leu 



<210> 68 

-20- 
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05-05-' 05 17:44 FROM- 



T-330 P072/112 F-313 



PCT/AU2O03/00H87 



WO 2004/042055) 



^211> 6 
<212> PRT 

^13> Artificial Sequence 



<220> 

<223> Leu(CTG)x6 
<:400> 6B 

Leu Leu Leu Leu Leu Leu 



<210> 69 

<2ll> 18 

<2l2> DNA 

<213> Artificial sequence 
<220> 

<223> Leu(TTA)X<S 



<220> 

<22l> CDS 
<222> (1) (18) 

<400> 69 

tea tta tta tta tta tta 
Leu Leu Leu Leu Leu Leu 



<210> 70 

<211> 6 

c212> PRT 

<213> Artificial Sequence 
<220> 

<223> Leu (TTA) x6 

<4O0> 70 

Leu Leu Leu Leu Leu Leu 



<210> 71 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Leu(CTT)x6 



<220> 
<221> CDS 
<222> (1) • • (18) 

<400> 71 

Ctt Ctt ctt Ctt ctt Ctt 
Leu Leu Leu Leu Leu Leu 



-21 - 
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05-05-' 05 17:44 FHOM- 



T-330 P073/112 F-313 





WO 2004/042050 



PCT/AU2003/001487 



1 



5 



<210> 72 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Leu(CTT)x6 

<4O0> 72 

Leu Leu Leu Leu Leu Leu 
1 5 



<21Q> 73 

<211> 18 

<212> DtfA 

<213> Artificial Sequence 
<220> 

<223> Lys (AAG) x6 



<220> 

<22l> CDS 
<222> (1) . . (18) 

<400> 73 

aag aag aag aag aag aag 
Lys Lys Lys Lys Lys Lys 
1 5 



<210> 74 

<211> 6 

<2i2> mr 

<213> Artificial Sequence 
<220> 

<223> Lys (AAG) X6 

<400> 74 

Lys Lys Lys Lys lys Lys 
1 5 



<210> 75 

<2U> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Lys (AAA) x6 



<220> 
<221> CPS 
<222> (1)..(18) 



-22- 
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r x 



05-05-' 05 17:44 FROM- 



T-330 P074/112 F-313 



WO 2004/04205*) 



PCT/AU2003/001487 



<400> 75 

aaa aaa aaa aaa aaa aaa 
Lys Lys Lys Lys Lys Lys 



<210> 76 

<211> 6 

<2X2z> PRT 

<213> Artificial Sequence 
<220> 

<223> Lys (AAA)x6 

<400> 76 

Lys Lys Lys Lys Lye Lys 
1 5 



<210> 77 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Phe(TTT)x6 



<220> 

<221> CDS 
<222> U)..<18) 

<400> 77 

ttt ttt ttt ttt ttt ttt 
Phe Phe Phe Phe Phe Phe 
1 5 



<210> 78 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Phe (TTT) x6 

<400> 78 

Phe Phe Phe Phe Phe Phe 
1 5 



<210> 79 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Phe(TTC)x€ 



1 



5 



- 23 - 

PAGE 74/112 ^ RCVD AT 5/5/2005 3:24:30 AM [Eastern Daylight Time] * SVR:DCS-CS01/1 * DNIS:2047 * CSID: * DURATION (mm-ss):3346 



05-05-' 05 17:44 FROM- 



T-330 P075/112 F-313 




WO 2<I04MU2U5!> 



PCT7AU2003/<lOUtf7 



<220> 

<221> CDS 
<222> (1) • - (18) 



<400> 79 

ttc ttc ttc ttc ttc ttc 
Phe Phe Phe Phe Phe Phe 



18 



<210> 
<211> 
<212> 
<213> 



80 
6 

PRT 

Artificial Sequence 



<220> 
<223> 



Phe (TTC) X6 



<400> 



80 



Phe Phe Phe Phe Phe Phe 
1 5 



<210> 81 

<211> 18 

<212> DNA 

<213> Artificial sequence 

<:220> 

<223> Pro(CCC)x6 



<220> 

<221> CDS 
<222> (1) . . (18) 

<400> 81 X8 
ccc ccc ccc ccc ccc ccc 
Pro Pro Pro Pro Pro Pro 

1.5 



<210> 82 

c211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Pro (CCC) x6 

<400> 82 

Pro Pro Pro Pro Pro Pro 



<210> 83 

<211> 18 

<212> DWA 

<213> Artificial sequence 
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-24- 



* , 05-05-' 05 17:44 FROM- 



T-330 P076/112 F-313 



♦ 



WO 2004/Q4205!) 



PCT/AU200J/0014H7 



<220> 

*223> Pro(CCT)x6 



<220> 
<221> CDS 
<222>. {1)..U8) 

<400> 83 18 
cct cct cct cct cct cct 
Pro Pro Pro Pro Pro Pro 
1 * 



" <210> 84 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Pro (CCT) X6 



<210> 85 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Pro(CCG)x6 



<220> 
<221> CDS 
<:222> (1)..(18) 

<400> 85 IB 
ccg ccg ccg ccg ccg ccg 
Pro Pro Pro Pro Pro Pro 
1 s 



<210> 86 

<c211:> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Pro (CCG) X6 

<400> 86 

Pro Pro Pro Pro Pro Pro 



«c400> 



84 



Pro Pro Pro Pro Pro Pro 
1 5 



-25- 
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05-05-' 05 17:45 FROM- 



T-330 P077/112 F-313 





WO 2004/04205!) 



PCT/AU200J/0OU87 



<210> 



87 



<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Pro (CCA) X6 



<220> 

<221> CDS 
<222> (1) . . (18) 

<400> 87 

cca cca cca cca cca cca 
Pro Pro Pro Pro Pro Pro 
1 5 

<210> 88 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Pro (CCA) X6 

<400> 88 

Pro Pro Pro Pro Pro Pro 
1 5 



<210> 89 

<211> 18 

<212> DNA 

«c213> Artificial Sequence 
<220> 

<223> Ser(AGC)x6 



<220> 

<221> CDS 
<222> (D..U8) 

<400> 89 

age age age age age age 
Ser Ser Ser Ser Ser Ser 
1 5 



<210> 90 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ser(AGC)x6 

<400> 90 



-26 - 
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* . 05-05-' 05 17:45 FROM- 



# 



WO 2004/042059 



Ser Ser Ser Ser Ser Ser 
1 5 



<:210> 91 

<211> IB 

<212> DNA 
<213> 



Artificial sequence 



<=220> 

<223> Ser (TCT) x6 



<220> 
<221> 
<222> 



CDS 

(1) . . (18) 



<400> 91 

tct tct tct tct tct tec 
Ser Ser Ser Ser Ser Ser 
1 5 



T-330 P078/112 F-313 



PCT/AU2003/0014B7 



18 



<210> 92 

<211=> 6 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Ser (TCT) x6 
<400> 92 

Ser Ser Ser Ser Ser Ser 
1 5 



<210> 93 

<211> 18 

<212=» DNA 

<213> Artificial Sequence 
<220> 

<223> Ser(AGT)x6 



<220> 

<221> CDS 
<222> (1) ..(18) 

<400> 93 IB 
agt agt agt agt agt agt 
Ser Ser Ser Ser Ser Ser 



<210> 94 

<211> 6 

<212> PRT 

<:213> Artificial Sequence 



<220> 

-27 - 
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* > 



05-05-' 05 17:45 FROM- 



T-330 P079/112 F-313 



# 




WO 2004/042059 



PCT/AU2003/OO14«7 



<223> Ser(AGT)x6 



<4Q0> 94 



Ser Ser Ser Ser Ser Ser 



<210> 95 

<211> 18 

<:212 > DNA 

<213> Artificial Sequence 
<:22Q> 

<223> Ser(TCG)x6 



<220> 

<221> CDS 
<222> (1) US) 

<:400> 95 18 
teg teg teg teg teg teg 
ser Ser Ser Ser Ser Ser 
1 5 



<210> 96 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ser(TCG)x6 

<400> 96 

Ser Ser Ser Ser Ser Ser 



<210> 97 

<211> 18 

<212> DNA 

<213* Artificial Sequence 
<220> 

<223> Ser(TCA)x€ 



<220> 

<221> CDS 
<222> ID * ■ <1 B) 

<400> 97 

tea tea tea tea tea tea 
Ser Ser ser ser Ser Ser 



<210> 98 
<2ll> « 
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i 



5 
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* • 05-05-' 05 17:45 FROM- 
WO 2004AM2059 



<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Ser(TCA)x6 
<400> 98 

ser ser ser ser Ser Ser 
1 5 



<210> 99 

<2ll> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ser(TCC)x6 



<220> 

<c221> CDS 
<222> (1) . - (18) 

<400> 99 

tCC tec tec tec tec tec 

Ser Ser Ser Ser Ser ser 

1 * 



T-330 P080/112 F-313 




PCT/AU20Q3/001487 



18 



<210> 


100 


<211> 


6 


<212> 


PRT 


<212> 


Artif icial 


<220> 




<223> 


ser (tcc) xe 


<400> 


100 



Ser Ser Ser Ser Ser Ser 



<210> 101 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Thr (ACA) x6 



<220> 

<221> CDS 
c222> (1)..(1B) 

«400> 101 18 
aca aca aca aca aca aca 
Thr Thr Thr Thr Thr Thr 
l 5 

-29- 
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05-05-' 05 17:45 FROM- 



T-330 P081/112 F-313 




\ 




WO 2WU/042059 



PCT/AU200 J/001487 



<210> 102 

<2ll> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Thr{ACA)x6 

<400> 102 

Thr Thr Thr Thr Thr Thr 
1 5 



<210> 103 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Thr (ACG) X6 



<220> 

<221> CDS 
*222> (1)..(18) 

<:400> 103 

acg acg acg acg acg acg 
Thr Thr Thr Thr Thr Thr 



<210> 104 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Thr (ACG) x6 

<400> 104 

Thr Thr Thr Thr Thr Thr 
1 5 



<210> 105 

<211> 18 

<212> mh 

<213> Artificial Sequence 

. <220> 

c223> Thr (ACT) x6 



<220> 

<221> CDS 
<222> (l)..(l8) 



1 



5 



-30- 
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t . 05-05-' 05 17:45 FROM- 



T-330 P082/112 F-313 





WO 2004/042059 



PCT/AU2003/001487 



<400>- 105 



18 



act act act act act act 
Thr Thr Thr Thr Thr Thr 



<210> 106 
<211> 6 
<2l2> .. PRT 

<213> Artificial Sequence 
<220> 

<223> Thr (ACT) x6 
<c400> 106 

Thr Thr Thr Thr Thr Thr 
1 5 



<210> 107 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Thr (ACC) X6 



<220> 

<221> CDS 

<222> (1). .(18) 

<400> 107 



Thr Thr Thr Thr Thr Thr 



«£210> 108 

<211> € 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Thr (ACC) x6 

<400> 108 

Thr Thr Thr Thr Thr Thr 
1 5 



<210> 109 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Tyr(TAC)x6 



-31 - 
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i 



5 



acc acc acc acc acc acc 



1 



5 



, 05-05-' 05 17:46 FROM- 
WO 2004/04205!) 



<220> 

<221> CDS 
<222> (D . . (18) 

<400> 109 

tac tac tac tac tac tac 
Tyr Tyr Tyr Tyr Tyr Tyr 
1 5 



<210> 110 

<211> 6 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Tyr (TAC) x6 
<400> 110 

Tyr Tyr Tyr Tyr Tyr Tyr 
l S 



«c210> 111 

<211> 18 

<212> DUA 

<213> Artificial sequence 



<220> 

<223> Tyr (TAT) x6 



<220> 

<221> CDS 
<222> (1)-.(18) 

<400> 111 

tat tat tat tat tat tat 
Tyr Tyr Tyr Tyr Tyr Tyr 
1 * 



T-330 P083/112 F-313 




PCT/AU20O3/OO1487 



IB 



18 



<:210?> 112 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Tyr (TAT) x6 

<4 00> 112 

Tyr Tyr Tyr Tyr Tyr Tyr 
1 5 



<210> 113 
<2H> 18 
<212> DWA 

<213> Artificial Sequence 

. 32 - 
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WO 2004/04205!) 



PCT/AU2003/00M87 



<220> 
<223> 



Val (GTG) X6 



<:220> 
<221> 
<222> 



CDS 

(1) . - (IB) 



<400> H3 

gtg gtg gtg gtg gtg gtg 
Val Val Val Val Val Val 
1 5 



18 



<210> 114 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Val (GTG) *6 

<400> 114 

Val val Val Val Val Val 
1 5 



<210> 115 

<2H> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Val (GTT) x6 



<220> 

<221> CDS 
«222> (1)..(18) 

<400> 115 18 
gtt gtt gtt gtt gtt gtt 
Val Val Val Val Val Val 



<210> 116 

<211> 6 

*212> PRT 

<213> Artificial Sequence 
<220> 

<223> Val (GTT) X6 

<400> 116 

Val Val Val val Val val 



<210> H7 
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WO 2004/042059 



PCT/AU2OO3/00I4H7 



<211> 



18 



<212> DNA 

<213> Artificial sequence 



<220> 

<223> Val (GTC) x6 



<220> 

<221> CDS 
<222> (1).. (18) 

<400> 117 18 
gtc gtc gtc gtc gtc gtc 
Val Val Val Val Val Val 
1 5 



<210> 118 

c211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

«c223:> Val (GTC) x6 

<400> 118 

Val Val Val Val Val Val 
1 5 



<210> 119 

<211> 18 

<212> DtfA 

<213> Artificial Sequence 
<220> 

<223> Val (GTA) x6 



<220> 

<221> CDS 
<222> (1)^(18) 

<400> 119 

gta gta gta gta gta gta. 
Val Val Val Val Val Val 



<210> 120 

<211> 6 

<212> PRT 

<213> Artificial Sequence 



<22Q> 

<22 3> Val (GTA) x6 
<400> 120 

Val Val Val Val Val Val 
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<210> 121 

<:311> 2583 

<212> DNA 

<213> Mouse 



<220> 

<221> CDS 

<222> (1)..(2166) 



ga^ctt'cgg gac gag caa act ccg ggc cac agg aag aac cca teg aac 46 
G?u Leu Arg Lp 111 Gin Thr Pro Gly His Arg Lys Asn Pro Ser Asn 
1 5 10 

caa age age tta gaa tot gac toe aat tac ceo tec att tec act toe 
Gin ser Ser Leu Glu Ser Asp ser Asn Tyr Pro Ser lie ser Thr Ser 
20 25 30 

gaa ate gga gac act gag gat gec ctt cag cag gtg gag gag att ggc 
Glu lie Gly Asp Thr Glu Asp Ala Leu Gin Gin Val Glu Glu He Gly 
35 40 45 

ata gag aag gca gec atg gac atg acc gtc ttc ctg aag ctg cag aag 
5* l!u Lys Ala Ala Met Asp Met Thr Val Phe Leu Lys Leu Gin Lys 

55 



50 



60 



aga gtg cgc gaa ctt gag cag gag agg aag aag ctg cag gcg cag eta 
Arg Val Arg Glu Leu Glu Gin Glu Arg Lys Lys Leu Gin Ala Gin Leu 

oaa aaq gga cag cag gac age aag aaa ggg cag gta gaa caa cag aac 
Til Lys Sly Gin Gil Asp Ser Lys Lys Gly Gin Val Glu Gin Gin Asn 
85 90 95 

aat ggc tta gat gtg gac cag gac gca gat ata gee tac aat agt ctg 
Asn Gly Leu Asp Val Asp Gin Asp Ala Asp lie Ala Tyr Asn Ser Leu 
100 105 110 

aaa aga cag gag ctt gag tea gag aac aag aag ctg aag aat gac ctg 
lys Arg Gin Glu Leu Glu Ser Glu Asn Lys Lys Leu Lys Asn Asp Leu 
U5 120 !2S 

aat aaq ctg agg aac ggt gtc get gac caa gee atg cag gat aac tec 
Asn IS Leu Arg Asn lly Val Ala Asp Gin Ala Met Gin Asp Asn Ser 
130 135 140 

acc cac age tec cca gac age tac age etc eta ctg aac cag etc aag 
Thr His ser ser Pro Asp Ser Tyr Ser Leu Leu Leu Asn Gin Leu Lys 
146 150 155 

ctg gec aat gag gag etc gag gtc cgc aaa gag gag gcg ctg ate etc 
Leu Ala Asn Glu Glu Leu Glu Val Arg Lys Glu Glu Ala Leu lie Leu 
165 170 175 



agg acc cag ate atg aat gec gac cag cgc cgc ctg tct ggc aag aac 
Arq Thr Gin He Met Asn Ala Asp Gin Arg Arg Leu Ser Gly Lys Asn 
180 185 190 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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atg gag ccg aac ate aat gec aga aca age tgg ccc aac agt gag aag 
Met Glu Pro Asn He Asn Ala Arg Thr Ser Trp Pro Asn Ser Glu Lys 
195 200 205 

cac gtg gac cag gaa gac gec att gag gec tat cac ggg gtc tgc cag 
His Val Asp Gin Glu Asp Ala lie Glu Ala Tyr His Gly Val Cys Gin 



210 



215 220 



aca aac agg ttg ccg gag gec cag ctg cag gec cag age ctg gag cat 
Thr Asn Arg Leu Leu Glu Ala Gin Leu Gin Ala Gin Ser Leu Glu His 



225 230 235 240 

gag gag gag gtg gaa cat etc aag gec cag gtg gaa gec ctg aaa gag 
Glu Glu Glu Val Glu His Leu Lys Ala Gin Val Glu Ala Leu Lys Glu 
245 250 255 



aat gag agg aag ctg aag aag cag ctg aag att tac atg aag aag gtc 
Asn Glu Arg Lys Leu Lys Lys Gin Leu Lys He Tyr Met Lys Lys Val 

310 315 320 



305 



tgt ctg cct gca tac ata etc tat atg tgc ate agg cac gcg gat tac 
Cys Leu Pro Ala Tyr He Leu Tyr Met Cys He Arg His Ala Asp Tyr 

390 395 400 



624 



672 



720 



768 



816 



864 



gag atg gac aaa cag cag cag acc ttc tgc cag acc ctg ctg etc tec 
Glu Met Asp Lys Gin Gin Gin Thr Phe Cys Gin Thr Leu Leu Leu Ser 
260 265 270 

cca gag gec cag gta gaa ttt ggt gtc cag cag gag ata tec egg ctg 
Pro Glu Ala Gin Val Glu Phe Gly Val Gin Gin Glu He Ser Arg Leu 
275 280 285 

acc aat gag aac ctg gat ttt aag gaa ttg gtg gaa aag ctg gag aag 912 
Thr Asn Glu Asn Leu Asp Phe Lys Glu Leu Val Glu Lys Leu Glu Lys 
290 29S 300 



960 



1008 



cag gac tta gaa get gec cag gcg ttg gca cag agt gac agg agg cac 
Gin Asp Leu Glu Ala Ala Gin Ala Leu Ala Gin Ser Asp Arg Arg His 
325 330 335 

cat gaa etc aca aga cag gtc aca gtc caa cga aaa gag aag gac ttc 1056 
His Glu Leu Thr Arg Gin Val Thr Val Gin Arg Lys Glu Lys Asp Phe 
340 345 350 

caa ggc atg ctg gag tac cac aaa gag gtc gaa gec etc etc ate egg 1104 
Gin Gly Met Leu Glu Tyr His Lys Glu Val Glu Ala Leu Leu He Arg 
355 360 365 

aac ctg gtg aca gac ctg aag cct cag atg ctg ctg ggc acc gtg ccc 1152 
Asn Leu Val Thr Asp Leu Lys Pro Gin Met Leu LeU Gly Thr Val Pro 
370 375 380 



1200 



385 

acc aac gat gac etc aag gtg cac teg ttg ctg age tec acc ate aac 124 8 

Thr Asn Asp Asp Leu Lys Val His Ser LeU Leu Ser Ser Thr He Asn 

405 410 415 

ggc att aag aaa gtc etc aag aag cac aat gac gac ttt gag atg acg 1296 

Gly He Lys Lys Val Leu Lys Lys His Asn Asp Asp Phe Glu Met Thr 
420 42S 430 

tea ttc tgg tta tec aac acc tgc cgc ttc ctt cac tgt ctg aag caa 1344 

Ser Phe Trp Leu Ser Asn Thr Cys Arg Phe Leu His Cys Leu Lys Gin 

-36- 
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445 



435 440 



s s z £ a » s a £ a = £ a s a £ 

450 455 

**-r nsr etc act gaa tac cgc cag gta eta 
gag cac tgt etc aag aac ttt gac etc act g 

S2 His Cys Leu Lye Asn Phe Asp Leu Thr Glu Tyr Arg ^ 



465 



1392 



1440 



^ tat caa caq etc att aaa atg ccc gag 

S K £ £ £ 2 £ £ 2 f; - - £ 01u 

485 49U 

s = a a £ s £ « s k = = = a BS 

500 505 

S S S £ 2 £ S3 5S £ £ - 1 - S £ 

515 520 

2s s a £ ss a =s £ e s s e =3 £ 

530 535 

- a i-r ate ctq cag gtg ttc aaa cag etc ttc 
cag ggc ctg gac ccc gag att ate ctg cag g g ^ ^ phfi 

Gin Gly Leu Asp Pro Glu lie lie Leu bin v / ^ 

545 550 5 " 

,- ntc act ctt aac aac eta etc ctg egg aaa gac 

£ 2 S £ E S3 £ £ « « - - « »» ss Asp 

565 570 

ace t 9 c tec tgg age aea ggc atg caa etc agg tac aac ata age caa 
fla Cys ser t? sir Thr Gly Met Gin Leu Arg Tyr Asn lie Ser 
580 585 

a ss s 5 £-3 s £ = £ = = 5 85 e 

595 600 
«. ,cc « S M ccc ct 9 ■«. c. 9 K 9C. c.9 etc etc «. ccg £ 
Gin Thr Met Glu pro Leu He Gin Ala Ala em jeu 

610 61S 
aag aaa acc cac gag gat get gag gee ate tgc tct ctg tge ace tee 
Lys Lys Thr His Glu Asp Ala Glu Ala lie cys ser y ^ 

625 630 

s s £ a a s« £ £ £ £ = £ £ £ = 

645 550 

= s £ a a sss £ a = £ £ £ £ £ = 

660 66S 

s s £ = a a = 5 £ = = £ 5 = 5 ° s 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



196B 
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# 



W Q 2004/042059 PCT/AU2003/001487 



s = a s s e s £ = ss = = = s = = 



690 695 

^ r-ac ate cca qcc tgt cCC aac ccg gag ttC etc aat 

atg gac teg ate cac ate ccg gec g ^ ^ ph£ Leu Asn 

Met ASP Ser lie His lie pro w , 20 

705 "0 

gaa gte tgaggatgcg tgtttcegag gegagegaga aggaagcatg tgctgtcagc 
Glu Val 



2112 



2160 



2216 



227 6 



cgagagaatg ctaggtgtgt taaatattec agegtagate aaaecatgtt agagaetggc 
gggacgacag aaetaaaeag eggggtgeae agttgtegec aatgetgetc agaaaacace 2336 
cggaagtgga tttgttaaag etgtgcttte aggttaaaec aagacacgte agaacgaaca 2396 
occactetgc ageteeagtc gecatataaa aatgccagtt etaeagagtg gaagtgceta 2456 
gctttgatet ttgtatatat cttgagaatg ttcaaactga gataatatta aaaacacatg 2516 
aegtaaattg cctttgtggg tettteaaga aatgatggga ctaataacea taagattgac 
aggaatt 



2576 
2583 



<210> 122 

«c211> 722 

<2l2> PRT 

<213> Mouse 



<400> 122 

Glu Leu Arg Asp Glu Gin Thr Pro Oly His Arg Lys Asn Pro Ser Asn 
s 10 



Gin ser ser Leu Glu ser Asp Ser Asn Tyr Pro Ser lie ser Thr Ser 
20 25 



Glu lie Gly Asp Thr Glu Asp Ala Leu Gin Gin Val Glu Glu lie Gly 
35 40 

lie Glu Lys Ala Ala Met Asp Met Thr Val Phe Leu Lys Leu Gin Lys 
50 55 60 

Arg Val Arg Glu Leu Glu Gin Glu Arg Lys Lys Leu Gin Ala Gin Leu 
65 70 

Glu Lys Gly Gin Gin Asp Ser Lys Lys Gly Gin Val Glu Gin Gin Asn 

AS n Gly Leu Asp val Asp Gin Asp Ala Asp He Ala Tyr Asn Ser Leu 
100 105 
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Lys « g « «. Leu el- Ser A. M. -y= "*» gj «» «• - 

115 120 

ftsn Glu Leu Arg Asn 01/ Val Ala Asp Gin Ala Met Gin Asp Asn Ser 
130 135 

e-,- Tl ,r Leu Leu Leu Asn Gin Leu Lys 
Thr His Ser Ser Pro Asp Ser Tyr Ser Leu ufl 

145 «0 155 

Leu Ala Asn Glu Glu Leu Glu Val Arg Lys Glu Gl. Ala Leu lie Leu 



165 



170 



A rg Thr Gin lie Met Asn Ala Asp Gin Arg Arg Leu Ser Gly I*. A.n 



ibo i 85 



Met Glu Pro Asn lie Asn Ala Arg Thr Ser Trp Pro Asn Ser Glu Lys 



195 200 



His Val Asp Gin Glu Asp Ala lie Glu Ala Tyr His Gly Val Cys Gin 
210 215 220 

Thr Asn Arg Leu Leu Glu Ala Gin Leu Gin Ala Gin ser Leu Glu His 
225 230 235 

Glu Glu Glu Val Glu His Leu Lys Ala Gin Val Glu Ala Leu Lys Glu 
245 250 

Glu Met Asp Lys Gin Gin Gin Thr Phe Cys Gin Thr Leu Leu Leu Ser 
260 265 

pro Glu Ala Gin Val Glu Phe Gly Val Gin Gin Glu lie Ser Arg Leu 
275 260 2 

Thr Asn Glu Asn Leu Asp Phe Lys Glu Leu Val Glu Lys Leu Glu Lys 
290 295 300 

Asn Glu Arg Lys Leu Lys Lys Gin Leu Lys lie Tyr Met Lys Lys val 



305 



310 



Gin Asp Leu Glu Ala Ala Gin Ala Leu Ala Gin Ser Asp Arg Arg His 



325 



His Glu Leu Thr Arg Gin Val Thr Val Gin Arg Lys Glu Lys As P Phe 

345 ■ ,tlU 



340 



Gin Gly Met Leu Glu Tyr His Lys Glu Val Glu Ala Leu Leu He Arg 
355 3 «° 365 

-39- 

PAGE 901112 * RCVD AT 5/5/2005 3:24:30 AM [Eastern Daylight Time] * SVR:DCS-CS01/1 * DNIS:2047 * CSID: * DURATION (mm-ss):3346 



' 05-05-' 05 17:47 FROM- T-330 P091/112 F-313 

WO 2004/042059 PCT/AUWMWII487 
A6 n Leu VaX Thr Asp Leu Lys Pro Gin Met Leu Leu Gly Thr Val Pro 



370 



375 



cys Leu Pro Ala Tyr lie Leu Tyr Met Cys lie Arg His Ala Asp Tyr 



385 



390 



Thr Asn Asp Asp Leu Lys Val Hi* Ser Leu Leu Ser Ser Thr lie Asn 

405 410 

Gly lie Lys Lys Val Leu Lys Lys His Asn Asp Asp Phe Glu Met Thr 
420 425 

Phe Trp Leu Ser Asn Thr Cys Arg Phe Leu His Cys Leu Lys Gin 
' 435 440 445 

T yr ser Gly Asp Glu Gly Phe Met Thr Gin Asn He Ala Lys Gin Asn 

asr 460 



ser 

43$ 



450 4 *5 



Glu His cys Leu Lys Asn Phe Asp Leu Thr Glu Tyr Arg Gin Val Leu 

470 475 



465 



ser Asp Leu Ser He Gin lie Tyr Gin Gin Leu He Lys Met Pro Glu 
485 490 

Gly Leu Leu Gin Pro Met lie Val Ser Ala Met Leu Glu Asn Glu Ser 
500 505 5 " 

ne Gin Gly Leu Ser Gly Val Arg Pro Thr Gly Tyr Arg Lys Arg Ser 



515 



520 525 



ser ser Met Val Asp Gly Glu Asn Ser Phe His Thr Val Leu Cys As P 



530 "5 540 



Gin Gly Leu Asp Pro Glu lie He Leu Gin Val Phe Lys Gin Leu Phe 

SS0 55S 



545 



Tyr Met He Asn Ala Val Thr Leu Asn Asn Leu Leu Leu Arg Lys Asp 
y 565 570 575 

Ala Cys Ser Trp Ser Thr Gly Met Gin Leu Arg Tyr Asn lie Ser Gin 



Leu 



Glu Glu Trp Leu Arg Gly Lys Asn Leu His Gin Ser Gly Ala Val 
595 600 S °S 
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Gin Thr Met Glu Pro Leu He Gin Ala Ala Gin Leu Leu Gin Leu Lys 
610 615 620 

Lys Lys Thr His Glu Asp Ala Glu Ala He Cys Ser Leu Cys Thr Ser 
625 630 635 

Leu ser Thr Gin Gin lie val Lys lie Leu Asn Leu Tyr Thr Pro Leu 
645 6 50 655 

Asn Glu Phe Glu Glu Arg Val Thr Val Ser Phe lie Arg Thr lie Gin 
660 665 670 

Ala Gin Leu Gin Glu Arg Asn Asp Pro Gin Gin Leu Leu Leu Asp Ser 
675 680 685 

Lys His Val Phe Pro Val Leu Phe Pro Tyr Asn Pro Ser Ala Leu Thr 
690 695 700 

Met Asp Ser lie His He Pro Ala Cys Leu Asn Leu Glu Phe Leu Asn 
705 710 715 

GlU Val 



<Z10> 123 

<211> 549 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> BAR gene 



<220> 

<221> CDS 

<222> (1) . ■ (549) 



<400> 123 „„„ ___ 4fl 

age cca gaa cga cgc ccg gec gac ate cgc cgt gec acc gag gcg gac 

Ser Pro Glu Arg Arg Pro Ala Asp He Arg Arg Ala Thr Glu Ala Asp 

c 10 I 5 



1 



ata cca qcg gtc cgc acc ate gtc aac cac tac ate gag aca age acg 
Mec Pro Ala val Cys Thr He Val Asn His Tyr He Glu Thr Ser Thr 

25 30 



20 



gtc aac etc cgt acc gag ccg cag gaa ccg cag gag tgg acg gac gac 
Val Asn Phe Arg Thr Glu Pro Gin Glu Pro Gin Glu Trp Thr Asp Asp 

40 45 



35 



etc gtc cgc ctg egg gag cgc tat ccc tgg etc gtc gec gag gtg gac 
Leu Val Arg Leu Arg Glu Arg Tyr Pro Trp Leu Val Ala Glu Val Asp 
50 55 60 



96 



144 



3.92 
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aoc qW q tc gcc ggc ate gcc tac gcg ggc ccc tgg aag gca cgc aac 
III Su ?al £a Sy He Ala Tyr Ala Gly Pro Trp I*> Ala Arg Asn 

70 75 



65 



tac gac tgg acg gcc g« teg acc gtg tac gtc tec ccc cgc 



Sr Lp S Thr Tla £S Ser Thr Val Tyr val Ser Pro Arg His 



85 



fM cqq ac g gga ctg ggc tec acg etc tac acc cac ctg ctg aag tec 
III Arg S llY Leu G?y Ser Thr Leu Tyr Thr His Lea Leu Lys Ser 
100 105 110 

eta gag gca cag ggc ttc aag age gtg gtc get gtc ate ggg ctg ccc 
S Glu III Gin G?y Phe Lys Ser Val Val Ala val He Gly Leu Pro 
115 120 125 

aac gac ccg age gtg cgc atg cac gag gcg etc gga tat gcc ccc cgc 
ten Mp Pro Ser Va! Arg Met His Glu Ala Leu Gly Tyr Ala Pro Arg 
130 I" I" 

aac atq ctg egg gcg gcc ggc ttc aag cac ggg aac tgg cat gac gtg 
G?y ml HI Arg III La Gly Phe Lys His Gly Asn Trp His Asp Val 
145 1^0 «S 160 

got ttc tgg cag ctg gac ttc age ctg ccg gtg ccg ccc cgt ccg gtc 
Sy Phe Trp Gin Leu Asp Phe Ser Leu Pro Val Pro Pro Arg Pro Val 
165 170 i75 

ctg ccc gtc acc gaa ate tga 
Leu Pro Val Thr Glu He 
180 



c210> 124 

<211> 182 

<212> PRT 

<213> Artificial Sequence 
<220> 

<222> BAR gene 

<400> 124 



Ser Pro Glu Arg Arg Pro Ala Asp He Arg Arg Ala Thr Glu Ala Asp 
X 5 



10 15 



Met Pro Ala Val Cys Thr He Val Asn His Tyr He Glu Thr ser Thr 
20 25 30 

Val Asn Phe Arg Thr Glu Pro Gin Glu Pro Gin Glu Trp Thr Asp Asp 
35 40 45 

Leu Val Arg Leu Arg Glu Arg Tyr Pro Trp Leu Val Ala Glu Val Asp 
50 5S 60 

Gly Glu Val Ala Gly He Ala Tyr Ala Gly Pro Trp Lys Ala Arg Asn 
65 70 75 



240 



cac 288 



336 



384 



432 



480 



52 8 



54 9 
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Ala Tyr Asp Trp Thr Ala Glu Ser Thr Val Tyr Val Ser Pro Arg His 
85 90 95 

Gin Arg Thr Gly Leu Gly Ser Thr Leu Tyr Thr His Leu Leu Lys Ser 
100 105 110 



Leu Glu Ala Gin Gly Phe Lys Ser Val Val Ala Val lie Gly Leu Pro 
115 120 125 



Asn Asp Pro Ser Val Arg Met His Glu Ala Leu Gly Tyr Ala Pro Arg 
130 135 1^0 



Gly Met Leu Arg Ala Ala Gly Phe Lys His Gly Asn Trp His Asp Val 

145 150 155 160 

Gly Phe Trp Gin Leu Asp Phe Ser Leu Pro Val Pro Pro Arg Pro Val 
165 170 175 



Leu Pro Val Thr Glu lie 
180 



<210> 125 

<211> 366 

<212> DNA 

<213> Human 



<220> 

<221> CDS 

<222> (1) - - (363) 

<400> 125 

atg gga aaa ggt gtg aaa tec ccg egg gag aag tea cgc tat gag acc 

Met Gly Lys Gly Val Lys Ser Pro Gly Glu Lys Ser Arg Tyr Glu Thr 

1 5 10 15 

tea ctg aat ctg acc acc aag cgc ttc ctg gag ctg ctg age cac teg 
ser Leu Asn Leu Thr Thr Lys Arg Phe Leu Glu Leu Leu Ser His Ser 
20 25 30 



cag aag egg cgc ate tat gac ate acc aac gtc ctt gag ggc ate cag 
Gin Lys Arg Arg lie Tyr Asp He Thr Asn Val Leu Glu Gly He Gin 
50 55 60 

etc att gec aag aag tec aag aac cac ate cag ngg ctg ggc age cac 
Leu He Ala Lys Lys Ser Lys Asn His He Gin Trp Leu Gly Ser His 
65 70 75 80 

acc aca gtg ggc gtc ggc gga egg ctt gag ggg ttg acc cag gac etc 
Thr Thr Val Gly Val Gly Gly Arg Leu Glu Gly Leu Thr Gin Asp Leu 
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4 8 



96 



get gac ggc gtc gtc gac ctg aac tgg get gec gag gtg ctg aag gtg 144 
Ala Asp Gly Val Val Asp Leu Asn Trp Ala Ala Glu Val Leu Lys Val 
35 40 45 



192 



240 



2 88 
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8S 90 95 

cga cag ctg cag gag age gag cag cag ctg gac cac ctg atg aat ate 336 

Arg Gin Leu Gin Glu Ser Glu Gin Gin Leu Asp His Leu Met Asn He 

100 105 HO 



tgt act acg cag ctg cgc ctg etc tec tga 
Cys Thr Thr Gin lieu Arg Leu Leu Ser 
115 120 



366 



<210> 


126 


<211> 


121 


<212> 


PRT 


<213> 


Human 


<400> 


126 



Met Gly Lys Gly Val Lys Ser Pro Gly Glu Lys Ser Arg Tyr Glu Thr 
1 5 10 15 



Ser Leu Asn Leu Thr Thr Lys Arg Phe Leu Glu Leu Leu Ser His Ser 
20 25 30 



Ala Asp Gly val Val Asp Leu Asn Trp Ala Ala Glu Val Leu Lys Val 
35 40 45 



Gin Lys Arg Arg He Tyr Asp He Thr Asn Val Leu Glu Gly He Gin 
50 55 60 



Leu He Ala Lys Lys Ser Lys Asn His lie Gin Trp Leu Gly Ser His 
65 70 75 SO 



Thr Thr Val Gly Val Gly Gly Arg Leu Glu Gly Leu Thr Gin A9p Leu 
85 90 95 



Arg Gin Leu Gin Glu Ser Glu Gin Gin Leu Asp His Leu Met Asn He 
100 105 no 



Cys Thr Thr Gin Leu Arg Leu Leu Ser 
115 120 



-44- 
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